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Preface 



This volume contains the papers selected for presentation at IPCO VI, the Sixth 
International Conference on Integer Programming and Combinatorial Optimiza- 
tion, held in Houston, Texas, USA, June 22-24, 1998. The IPCO series of confer- 
ences highlights recent developments in theory, computation, and applications 
of integer programming and combinatorial optimization. 

These conferences are sponsored by the Mathematical Programming Society, 
and are held in the years in which no International Symosium on Mathemati- 
cal Programming takes place. Earlier IPCO conferences were held in Waterloo 
(Canada) in May 1990; Pittsburgh (USA) in May 1992; Erice (Italy) in April 
1993; Copenhagen (Denmark) in May 1995; and Vancouver (Canada) in June 
1996. 

The proceedings of IPCO IV (edited by Egon Balas and Jens Clausen in 
1995) and IPCO V (edited by William Cunningham, Thomas McCormick, and 
Maurice Queyranne in 1996), were published by Springer- Verlag in the series 
Lecture Notes in Computer Science as Volumes 920 and 1084, respectively. The 
proceedings of the first three IPCO conferences were published by organizing 
institutions. 

A total of 77 extended abstracts, mostly of an excellent quality, were initially 
submitted. Following the IPCO policy of having only one stream of sessions over 
a three day span, the Program Committee selected 32 papers. As a result, many 
outstanding papers could not be selected. 

The papers included in this volume have not been refereed. It is expected 
that revised versions of these works will appear in scientific journals. 

The Program Committee thanks all the authors of submitted extended ab- 
stracts and papers for their support of the IPCO conferences. 
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The Packing Property * 



Gerard Cornuejols^, Bertrand Guenin^, and Frangois Margot^ 



^ Graduate School of Industrial Administration 
Carnegie Mellon University, Pittsburgh, PA 15213, USA 
^ Department of Mathematical Science 
Michigan Technical University, Houghton, MI 49931, USA 



Abstract. A clutter (V,E) packs if the smallest number of vertices 
needed to intersect all the edges (i.e. a transversal) is equal to the maxi- 
mum number of pairwise disjoint edges (i.e. a matching). This terminol- 
ogy is due to Seymour 1977. A clutter is minimally nonpacking if it does 
not pack but all its minors pack. A 0,1 matrix is minimally nonpacking if 
it is the edge- vertex incidence matrix of a minimally nonpacking clutter. 
Minimally nonpacking matrices can be viewed as the counterpart for the 
set covering problem of minimally imperfect matrices for the set packing 
problem. This paper proves several properties of minimally nonpacking 
clutters and matrices. 



1 Introduction 

A clutter C is a pair (V{C), E{C)), where V{C) is a finite set and E{C) = 
{Si, . . . , Sm} is a family of subsets of V(C) with the property that Si C Sj 
implies Si = Sj. The elements of V(C) are the vertices of C and those of E{C) 
are the edges. A transversal of C is a minimal subset of vertices that intersects 
all the edges. Let t(C) denote the cardinality of a smallest transversal. A clutter 
C packs if there exist r(C) pairwise disjoint edges. 

For j G V{C), the contraction C/ j and deletion C \ j are clutters defined as 
follows: both have V{C) — {j{ as vertex set, E{C/j) is the set of minimal elements 
of {S' — {j} : S G E(C)} and E(C \ j) = {S : j ^ S G E(C)}. Gontractions and 
deletions of distinct vertices can be performed sequentially, and it is well known 
that the result does not depend on the order. A clutter T> obtained from C by 
deleting Id QV (C) and contracting Ic Q V (C), where IcHld = 0 and IcUld yf 0, 
is a minor of C and is denoted by C \ Id/Ic- 

We say that a clutter C has the packing property if it packs and all its minors 
pack. A clutter is minimally non packing (mnp) if it does not pack but all its 
minors do. In this paper, we study mnp clutters. 

* This work was supported in part by NSF grants DMI-9424348, DMS-9509581, ONR 
grant N00014-9710196, a William Larimer Mellon Fellowship, and the Swiss National 
Research Fund (FNRS). 

R. E. Bixby, E. A. Boyd, and R. Z. Rios-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 1-8, 1998. © Springer-Verlag Berlin Heidelberg 1998 
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These concepts can be described equivalently in terms of 0,1 matrices. A 0,1 
matrix A packs if the minimum number of columns needed to cover all the rows 
equals the maximum number of nonoverlapping rows, i.e. 

n 

min I Xj : Ax > e, 

1 

m 

= max {'^yi-.yA<e, 

1 

where e denotes a vector of appropriate dimension all of whose components 
are equal to 1. Obviously, dominating rows play no role in this definition (row 
Ai, dominates row Ak., fc yf t, if Aij > Akj for all j), so we assume w.l.o.g. 
that A contains no such row. That is, we assume w.l.o.g. that A is the edge- 
vertex incidence matrix of a clutter. Since the statement “A packs” is invariant 
upon permutation of rows and permutation of columns, we denote by A(C) 
any 0,1 matrix which is the edge-vertex incidence matrix of clutter C. Observe 
that contracting j G V{C) corresponds to setting Xj = 0 in the set covering 
constraints A(C)x > e (since, in A{C/j), column j is removed as well as the 
resulting dominating rows), and deleting j corresponds to setting Xj = 1 (since, 
in A{C \ j), column j is removed as well as all rows with a 1 in column j). The 
packing property for A requires that equation (1) holds for the matrix A itself 
and all its minors. This concept is dual to the concept of perfection (Berge [1]). 
Indeed, a 0,1 matrix is perfect if all its column submatrices A satisfy the equation 

n 

max { ^ Xj : Ax < e, x G {0, 1}^} 

1 

m 

= min{ : yA > e, yG{0, 1}™}. 

1 

This definition involves “column submatrices” instead of “minors” since setting 
a variable to 0 or 1 in the set packing constraints Ax < e amounts to con- 
sider a column submatrix of A. Pursuing the analogy, mnp matrices are to the 
set covering problem what minimally imperfect matrices are to the set packing 
problem. 

The 0,1 matrix A is ideal if the polyhedron {x > 0 : Ax > e} is integral 
(Lehman [9]). If A is ideal, then so are all its minors [16]. The following result 
is a consequence of Lehman’s work [10]. 

Theorem 1. If A has the packing property, then A is ideal. 

The converse is not true, however. A famous example is the matrix Qq with 
4 rows and 6 columns comprising all 0,1 column vectors with two O’s and two 
I’s. It is ideal but it does not pack. This is in contrast to Lovasz’s theorem [11] 
stating that A is perfect if and only if the polytope {x > 0 : Ax < e} is integral. 



XG{0,1}”} 

2/e{o,ir}, (1) 
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The 0,1 matrix A has the Max-Flow Min-Cut property (or simply MFMC 
property) if the linear system Ax > e, a; > 0 is totally dual integral (Seymour 
[16]). Specifically, let 

t{A, w) = min {wa; : Ax > e, x € {0, 1}”}, 

m 

v{A,w) =ma,x{'^yi:yA<w, yG{0, 1}™}. 

1 

A has the MFMC property if t{A,w) = v{A,w) for all w G Z”. Setting Wj = 0 
corresponds to deleting column j and setting Wj = +oo to contracting j. So, 
if A has the MFMC-property, then A has the packing property. Conforti and 
Cornuejols [3] conjecture that the converse is also true. 

Conjecture 1. A clutter has the packing property if and only if it has the MFMC 
property. 

This conjecture for the packing property is the analog of the following version of 
Lovasz’s theorem [11]: A 0, 1 matrix A is perfect if and only if the linear system 
Ax < e, a: > 0 is totally dual integral. 

In this paper, our first result is that this conjecture holds for diadic clutters. 
A clutter is diadic if its edges intersect its transversals in at most two vertices 
(Ding [6]). In fact, we show the stronger result: 

Theorem 2. A diadic clutter is ideal if and only if it has the MFMC property. 

A clutter is said to be minimally non ideal (mni) if it is not ideal but all 
its minors are ideal. Theorem 1 is equivalent to saying that mni clutters do not 
pack. Therefore mnp clutters fall into two distinct classes namely: 

Remark 1. A minimally non packing clutter is either ideal or mni. 

Next we consider ideal mnp clutters. Seymour [16] showed that Qq is the only 
ideal mnp clutter which is binary (a clutter is binary if its edges have an odd 
intersection with its transversals). Aside from Qq, only one ideal mnp clutter 
was known prior to this work, due to Schrijver [14]. We construct an infinite 
family of such mnp clutters (see Appendix). The clutter Qq, Schrijver’s example 
and those in our infinite class all satisfy r(C) = 2. Our next result is that all 
ideal mnp clutters with r(C) = 2 share strong structural properties with Qq. 

Theorem 3. Every ideal mnp clutter C with t{C) = 2 has the Qq property, i.e. 
A{C) has 4 rows such that every column restricted to this set of rows contains 
two O’s and two I’s and, furthermore, each of the 6 such possible 0,1 vectors 
occurs at least once. 

We make the following conjecture and we prove later that it implies Conjecture 1. 
Conjecture 2. If C is an ideal mnp clutter, then t(C) = 2. 
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The blacker b{C) of a clutter C is the clutter with V{C) as vertex set and the 
transversals of C as edge set. For Id, Ic C V{C) with fl /c = 0, it is well known 
and easy to derive that b{C \ Id/Ic) = b{C)/Id \ Ic- 

We now consider minimally non ideal mnp clutters. The clutter Ht, for t > 2 
integer, is given by V{Jt) = and E{Jt) = {1, . . . , t}, {0, 1}, {0, 2}, . . . , 

{0,t}. Given a mni matrix A, let x be any vertex of {a; > 0 : Ax > e} with 
fractional components. A maximal row submatrix A of A for which Ax = e is 
called a core of A. The next result is due to Lehman [10] (see also Padberg [13], 
Seymour [17]). 

Theorem 4. Let A be a mni matrix, B = 6(A), r = t{B) and s = t{A). Then 

(i) A (resp. B) has a unique core A (resp. B). 

(ii) A, B are square matrices. 

Moreover, either A = A{Jt), t > 2, or the rows and columns of A can be 
permuted so that 

(Hi) AB^ = J + (rs — n)I. 

Here J denotes a square matrix filled with ones and / the identity matrix. Only 
three cores with rs = n + 2 are known and none with rs > n + 3. Nevertheless 
Cornuejols and Novick [5] have constructed more than one thousand mni matri- 
ces from a single core with rs = n + 2. An odd hole is a clutter with fc > 3 
odd, P(C|) = k} and E{Ci) = {{1, 2}, {2, 3}, . . . , {A: - 1, k}, {k, 1}}. Odd 
holes and their blockers are mni with rs = n -I- 1 and Luetolf and Margot [12] 
give dozens of additional examples of cores with rs = n -I- 1 and n < 17. We 
prove the following theorem. 

Theorem 5. Let A A{Jt) be a mni matrix. If A is minimally non packing, 
then rs = n -I- 1. 

We conjecture that the condition rs = n -I- 1 is also sufficient. 

Conjecture 3. Let A A(^Jt) be a mni matrix. Then A is minimally non packing 
if and only if rs = n -I- 1 . 

Using a computer program, we were able to verify this conjecture for all known 
mni matrices with n < 14. 

A clutter is minimally non MFMC if it does not have the MFMC property 
but all its minors do. Conjecture 1 states that these are exactly the mnp clutters. 
Although we cannot prove this conjecture, the next proposition shows that a 
tight link exists between minimally non MFMC and mnp clutters. The clutter 
V obtained by replicating element j G V(C) of C is defined as follows: V{T>) = 
V{C) U {j'} where j' ^ V{C), and 

E{V) = E{C) U {5 - {j} U {/} :jGSG E{C)}. 

Element f is called a replicate of j. Let ej denote the unit vector. 

Remark 2. V packs if and only if r(C, e -I- Cj) = v{C, e -I- Cj). 
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Proposition 1. Let C he a minimally non MFMC clutter. We can construct a 
minimally non packing clutter T> by replicating elements ofV{C). 

Proof. Let w G Zf: he chosen such that t(C, w) > v{C, w) and r(C, w') = v{C, w') 
for all w' e Zf: with w' < w and w' < wj for at least one j. Note that wj > 0 
for all j, since otherwise some deletion minor of C does not have the MFMC 
property. Construct T> by replicating Wj — 1 times every element j G V{C). By 
Remark 2, T> does not pack. Let V = 'D\Id/Ic be any minor of T>. If j or one of 
its replicates is in then we can assume that j and all its replicates are in Ic. 
Then V is a replication of a minor C oiCj j. Since C has the MFMC property, 
V packs by Remark 2. Thus we can assume Ic = %. By the choice of w and 
Remark 2, if /d yf 0 then V packs. □ 

Proposition 1 can be used to show that if every ideal mnp clutter C satisfies 
t(C) = 2 then the packing property and the MFMC property are the same. 

Proposition 2. Conjecture 2 implies Conjecture 1. 

Proof. Suppose there is a minimally non MFMC clutter C that packs. By The- 
orem 1, C is ideal. Then by Proposition 1, there is a mnp clutter T> with a 
replicated element j. Furthermore, T> is ideal. Using Conjecture 2, 2 = t{T>) < 
T{V/j). Since V/j packs, there are sets S'i,S '2 G E(fD) with Si H S 2 = {j}. 
Because j is replicated in T>, we have a set S'j = S'! U {f} — {j}. Remark that 
f ^ S 2 . But then 5^ n 52 = 0, hence T> packs, a contradiction. □ 

Finally, we introduce a new class of clutters called weakly binary. They can 
be viewed as a generalization of binary and of balanced clutters. (A 0,1 matrix 
is balanced if it does not have A(C^) as a submatrix, fc > 3 odd, where as above 
Cl denotes an odd hole. See [4] for a survey of balanced matrices) . We say that 
a clutter C has an odd hole Cf if A(C^) is a submatrix of A(C). An odd hole Cf 
of C is said to have a non intersecting set if 35 G E{C) such that 5 C U (C^) = 0. 
A clutter is weakly binary if, in C and all its minors, all odd holes have non 
intersecting sets. 

Theorem 6. Let C be weakly binary and minimally non MFMC. Then C is 
ideal. 

Note that, when C is binary, this theorem is an easy consequence of Seymour’s 
theorem saying that a binary clutter has the MFMC property if and only if it 
does not have Qe as a minor [16]. Observe also that Theorem 6 together with 
Conjecture 2, Proposition 2, and Theorem 3, would imply that a weakly binary 
clutter has the MFMC property if and only if it does not contain a minor with 
the Qe property. 
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Appendix 

We construct ideal minimally non packing clutters C with r(C) = 2. By Theo- 
rem 3, these clutters have the Qq property. Thus V(C) can be partitioned into 
Ii, . . . ,Iq and there exist edges Si, ... ,84 in C of the form: 

Si = Ii LS I 3 C I 5 , S 2 = Ii Li I 4 LI Iq, 

S'3 = /2 u A U /s, S 4 = I 2 L I 3 L Iq. 
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Without loss of generality we can reorder the vertices in V{C) so that elements 
in Ik preceed elements in Ip when k < p. 

Given a set V oi p elements, let Up denote the ((2^ — 1) x p) matrix whose 
rows are the characteristic vectors of the nonempty subsets of V, and let be 
its complement, i.e. Up + Up = J. 

For each r,t > 1 let |/i| = I/2I = r, I/3I = IJ4I = t and I/5I = [/el = 1. We 
call Qr,t the clutter corresponding to the matrix 
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where J denotes a matrix filled with ones. The rows are partitioned into four 
sets that we denote respectively by T(3, 5), T(4, 5), T(l, 6), T(2, 6). The indices 
k, I for a given family indicate that the set Ik U// is contained is every element of 
the family. Note that the edge Si occurs in T(3,5), S 2 in T(l,6), S 3 in T(4, 5) 
and Si in T(2, 6). 

Since Hi contains only one row, we have Qi^i = Qq and Q2,i is given by 
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T(3,5) 

T(4,5) 

T(l,6) 

T(2,6) 



Proposition 3. For all r,t > 1, the clutter Qr,t is ideal and minimally non 
packing. 

The clutter V obtained by duplicating element j G V{C) of C is defined by: 
V{V) = V{C)U{f} where / ^ V{C) and E(V) = {S : j ^ S G F1(C)}U{S'U{/} : 
j G S G E{C)}. Let a{k) be the mapping defined by: a(l) = 2, a(2) = 1, a(3) = 
4, a(4) = 3, a(5) = 6, a(6) = 5. 

Suppose that, for k G {!,.., 6}, we have that Ik contains a single element 
j G V(C). Then j belongs to exactly two of ^i, . . . , S'4. These two edges are of 
the form {j} LI H LI It and {j} U Ia(r) U Ia{t)- We can construct a new clutter 
C ® j by duplicating element j in C and including in E(C ® j) the edges: 

{j} U 4(4) LirLit, 



{j } LI Ia(j) L Ia{r) L Ia{t)- 



( 2 ) 
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Since the construction is commutative we denote hy C'^{ki, ... ,ks} the clutter 
(C 0 fci) . . . (g) fcg. For Qq, we have Ii = {1} = S'! n S '2 and {1} U Ia(i) LI I3U I5 = 
{1,2, 3, 5} and finally |l'}U/„(i) U 4(3) U 4(g) = {l',2,4,6}. Thus 
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Proposition 4. Any clutter obtained from Qq and the ® construction is ideal 
and minimally non packing. 

The clutter Qe (1, 3, 5} was found by Schrijver [14] as a counterexample to a 
conjecture of Edmonds and Giles on dijoins. Prior to this work, Qe smd Qe G 
{1,3,5} were the only known ideal mnp clutters. Eleven clutters can be obtained 
using Proposition 4. In fact it can be shown [8] that this proposition remains 
true if we replace by Qr,t- There are also examples that do not fit any of the 
above constructions, as shown by the following ideal mnp clutter. 
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A Characterization of Weakly Bipartite Graphs 



Bertrand Guenin 

Graduate School of Industrial Administration 
Carnegie Mellon University, Pittsburgh, PA 15213, USA 



Abstract. A labeled graph is said to be weakly bipartite if the clutter 
of its odd cycles is ideal. Seymour conjectured that a labeled graph is 
weakly bipartite if and only if it does not contain a minor called an odd 
K^. An ontline of the proof of this conjecture is given in this paper. 



1 Introduction 

Let Q = (y E) be a graph and E C E. Edges in S are called odd and edges in 
E — S are called even. The pair {Q, E) is called a labeled graph. Given a subgraph 
TL of Q, V{H) denotes the set of vertices of H, and E{H) the set of edges of H. 
A subset L C E{Q) is odd (resp. even) if \L n E\ is odd (resp. even). A cycle of 
5 is a connected subgraph of Q with all degrees equal to two. 

A labeled graph {Q, E) is said to be weakly bipartite if the following polyhe- 
dron Q is integral (i.e. all its extreme points are integral): 

Q = e cycles C of {Q, A)| (1) 

See Gerards [7] for a recent survey on weakly bipartite graphs and connexions 
with multicommodity flows. Particularly interesting is the case where E = E{Q). 
Let X be any 0, 1 extreme point of Q. Then x is the incidence vector of a set of 
edges which intersect every odd cycle of tj. In other words e — x is the incidence 
vector of a bipartite subgraph of Q. Let w G be weights for the edges of Q 
and let a; be a solution to 

min { wa; : a; G Q n {0, } . (2) 

Then e — a; is a solution to the Weighted Max-Gut problem. This problem is 
known to be NP-Hard even in the unweighted case [10]. Note, weakly bipartite 
graphs are precisely those graphs for which the integrality constraints in (2) can 
be dropped. 

Weakly bipartite graphs Q with E = E{Q) were introduced by Grotschel and 
Pulleyblank [8]. They showed that the optimization problem min{wa; : x G Q} 
can be solved in polynomial time. 

Barahona [1] proved that planar graphs are weakly bipartite. In fact, Fonlupt, 
Mahjoub and Uhry [5] showed that all graphs which are not contractible to 

R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 9-22, 1998. @ Springer-Verlag Berlin Heidelberg 1998 
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are weakly bipartite. This is closely related to an earlier result by Barahona 
[2] on the cut polytope. Note, this does not yield a characterization of weakly 
bipartite graphs. Consider the graph obtained from by replacing one edge by 
two consecutive edges. This graph is weakly bipartite and contractible to K^. 

Following Gerards [6], we will define a number of operations on labeled 
graphs, which maintain the weak bipartite property. Given U C V{Q), the cut 
{{u,v) : u G U,v ^ U} is denoted by S{U). Given two sets Si, S 2 , the symmetric 
difference (S'! US' 2 ) — (S'! nS' 2 ) is denoted by 5i AS' 2 . The labeled graph obtained 
by replacing A by 27 A S{U) in {Q, 27) is called a relabeling of {Q, 27). Since 6 {U) 
intersects every cycle an even number of times we readily obtain that: 

Remark 1. {Q, 27) and {Q, 27 A 5{U)) have the same set of odd cycles. 

{Q, 27) \ e denotes a labeled graph {Q' , 27'), where 27' = 27 — {e} and Q' is 
obtained by removing edge e from Q. {Q, 27) je denotes the labeled graph obtained 
as follows: (1) if e is odd (i.e. e G 27) then find a relabeling of {Q, 27) such that 
e is even, (2) contract edge e in ^ (remove edge e in ^ and identify both of 
its endpoints). (f7, 27) \ e is called a deletion minor and {Q,S)le a contraction 
minor. Let Q be the polyhedron associated with {Q, 27), see (1). It can be readily 
shown (see for example the introduction of [18]) that deleting edge e corresponds 
to projecting Q onto a lower subspace and contracting e corresponds to setting 
Xe to zero. A labeled graph {H, 0) is called a minor of {Q, 27), if it can be obtained 
as a sequence of relabelings, deletions and contractions. It follows from Remark 
1 and the above observations that: 

Remark 2. If {G, 27) is weakly bipartite then so are all its minors. 

{H, 9) is called a proper minor of {Q, 27) if it is a minor of {Q, 27) and \E{H)\ < 
|i7(^)|. An odd K 5 , denoted by K 5 , is the complete graph on 5 vertices where 
all edges are labeled odd. For the polyhedron Q associated with K^, the 10 
constraints corresponding to the triangles (the odd cycles of length three) de- 
fine a fractional point (|,...,i) of Q. Thus K 5 is not weakly bipartite. Sey- 
mour [18], [19] predicted, as part of a more general conjecture on binary clutters 
(see Sec. 2.3) that: 

Conjecture 1. {G, 27) is weakly bipartite if and only if it has no minor. 

A labeled graph is said to be minimally non weakly bipartite if it is not weakly 
bipartite but all its proper minors are. An outline to the proof of the following 
theorem (which is equivalent to Gonjecture 1) is given in this paper. 

Theorem 1. Every minimally non weakly bipartite graph is a relabeling of K^. 

Section 2 introduces some basic notions on clutters, before stating a theorem by 
Lehman [13] on minimally non ideal (mni) clutters. The section concludes by 
deriving key properties of binary mni clutters. Section 3 gives an outline of the 
proof of Theorem 1 . A complete proof of this result can be found in [9] . 
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2 Clutters 

2.1 Elementary Definitions 

A clutter ^ is a pair (E{A), f2{A)) where E{A) is a finite set and i7{A) is a 
family of subsets of E{A), say {S'!, . . . , S'm}, with the property that Si C Sj 
implies Si = Sj. M{A) denotes a 0, 1 matrix whose rows are the incidence 
vectors of the elements of Q{A). A clutter A is said to be ideal if the polyhedron 
Q{A) = {a; > 0 : M{A)x > e} is integral. We say that A is the clutter of odd 
cycles of a labeled graph (G,E) if E{A) = E{G) and the elements of n{A) are 
the odd cycles of (G,E). Thus a labeled graph is weakly bipartite when the 
clutter of its odd cycles is ideal. 

Given a clutter A and i G E{A), the contraction A/i and deletion A\i are 
clutters defined as follows: E{A/i) = E{A \i) = E{A) — {i}, f2{A/i) is the set 
of inclusion-wise minimal elements of {S' — {z} : S G ^{A)} and, I7(^\z) = (S : 
i ^ S G n{A)}. Contractions and deletions can be performed sequentially, and 
the result does not depend on the order. A clutter B obtained from .4 by a set 
of deletions Id and a set of contractions Ic, where Ic C\ Id = ^ is called a minor 
of A and is denoted by .4 \ Id/ Ic- We say that ,8 is a contraction minor (resp. 
deletion minor) if /d = 0 (resp. Jc = 0). The following is well known [13]: 

Remark 3. If A is ideal then so are its minors. 

We saw in Remark 1 that relabeling a labeled graph leaves the clutter of odd 
cycles unchanged. Contractions and deletions on a labeled graph (as defined 
in Sec. 1) are equivalent to the corresponding operations on the clutter of odd 
cycles. 

Remark Let A be the clutter of odd cycles of {G,E). Then 

— .4/e is the clutter of odd cycles of {G, E)/e and 

— .4 \ e is the clutter of odd cycles of (Iz, i7) \ e. 

Given a clutter A, the clutter b{A) is called the blocker of A and is defined 
as follows: E {b{A)) = E{A) and 12 (b{A)) is the set of inclusion-wise minimal 
elements of {U : C DU ^ 0, VC G f2{A)}. It is well known that b{A \ Ic/Id) = 
b{A)/Ic \ Id and that b{b{A)) = A [17]. If A is the clutter of odd cycles of a 
labeled graph (G,E), then the elements of f2 (b{A)) are of the form 6{U) A E, 
where 6{U) is a cut of G [6]. 

2.2 Minimally Non Ideal Clutters 

A clutter A is called minimally non ideal (mni) if it is not ideal but all its proper 
minors are ideal. Because of Remark 4, the clutter of odd cycles of a minimally 
non weakly bipartite labeled graph is mni. In this section we review properties 
of mni clutters. 

The clutter At, for t > 2 integer, is given by E{Jt) = {0, . . . , t} and f2{Jt) = 
{{1, . . . , t}, {0, 1}, {0, 2}, . . . , {0, t}}. The cardinality of the smallest element of 
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Q {b{A)) is denoted by t{A). In this section we consider the matrix representa- 
tion A = M{A) of a clutter A. We say that a matrix M{A) is mni when the 
clutter A is mni. The blocker of b{M{A)) is the matrix M (6(-4)) and t{A) is 
the smallest number of non-zero entries in any row of b {M{A)). 

Given a mni matrix A, let x be any extreme point of Q{A) = {a; > 0 : Ax > e} 
with fractional components. A maximal row submatrix A of A for which Ax = e 
is called a core of A. Two matrices are said to be isomorphic if one can be 
obtained from the other by a sequence of permutations of the rows and columns. 
The next result is by Lehman [13] (see also Padberg [16], Seymour [20]). 

Theorem 2. Let A he a mni matrix. Then B = b{A) is mni. Let r = t{B) and 
s = t{A). Either A is isomorphic to M{Jt) or 

(i) A_(resp. B) has a unique core A (resp. B). 

(ii) A, B are square matrices. 

Moreover, the rows and columns of A can he permuted so that 
(Hi) AB"'" = J + {rs — n)I, where rs — n > 1. 

Here J denotes a square matrix filled with ones and / the identity matrix. Also 
e is the vector of all ones, ej is the unit vector, and Bj denotes column j of 
B. The following is a special case of a result of Bridges and Ryser [3]: 

Theorem 3. Let A, B be matrices satisfying (ii),(iii) of Theorem 2. 

(i) Columns and_rows of A (resp. B) have exactly r (resp. s) ones. 

(ii) AB^_= I^B 

(Hi) A'^ B,j = e + (rs — n)cj. 

(iv) Let j be the index of any column of A. Let Ci, . . . ,0) (resp. U\, . . . ,Ur) 
he the characteristic sets of the rows of A (resp. B) whose indices are given 
by the characteristic set of column j of B (resp. A). Then C\, . . . ,Cs (resp. 
Ui, . . . , Ur) intersect only in {j} and exactly q = rs — n + 1 of these sets 
contain j . 

Note that in the last theorem. Property (ii) implies (iii) that in turn implies 
(iv). Because of Theorem 2 and Theorem 3(i) the fractional point x must be 
( i , . . . , i ) . The next remark follows from the fact that A is a maximal row 
submatrix of A for which Ax = e. 

Remark 5. Rows of A which are not rows of A have at least r -|- 1 non-zero 
entries. Similarly, rows of B which are not in B have at least s -I- 1 ones. 

Let A A fTi be a mni clutter with A = M(A). A, with M(A) = A, denotes the 
core of A. Let B be the blocker of A and B the core of B. Consider the element 
C G 0(A) (resp. U G L2(B)) which corresponds to the row of A (resp. B). 
By Theorem 2 (iii), C intersects every element of L2(B) exactly once except for 
U which is intersected q = rs — n + 1 > 2 times. We call U the mate of C. 
Thus every element of f2(A) is paired with an element of f2(B). Notice that 
Theorem 3(iv) implies the following result. 

Remark 6. Let A be a mni clutter distinct from Jt and consider C\,Ci G 17(A) 
with i G Cl n C 2 . The mates Ci, C /2 of Ci, C 2 satisfy Ci n C 2 C {i}. 
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2.3 Binary Clutters 

A clutter A is said to be binary if for any three sets Si, S 2 , S 3 € ^^(A), the set 
S'! AS '2 AS '3 contains a set of J7(A). Lehman [11] showed (see also Seymour [17]): 



Theorem 4. A is binary if and only if for any C G n{A) and U & fl {b{A)) we 
have \C C\U\ odd. 

Thus in particular if A is binary then so is its blocker. The following is easy, see 
for example [ 6 ]. 

Proposition 1. Let {Q, E) be a labeled graph. Then the clutter of odd cycles of 
{Q, E) is binary. 



2.4 Minimally Non Ideal Binary Clutters 

Note that the blocker of Jt is Jt itself. We therefore have {1,2} G E{Jt) and 
{1,2} G E {b{Jt)). It follows by Theorem 4 that Jt is not binary. The clutter T-j 
is defined as follows: E{T-j) = {1, . . . , 7} and 

= {{1, 3, 5}, {1, 4, 6}, {2, 4, 5}, {2, 3, 6}, {1, 2, 7}, {3, 4, 7}, {5, 6, 7}} . 

The clutter of odd cycles of K 3 is denoted by Ok^ ■ Conjecture 1 is part of a more 
general conjecture by Seymour on minimally non ideal binary clutters. See [18] 
p. 200 and [19] (9.2), (11.2). 

Conjecture 2. If .4 is a minimally non ideal binary clutter, then A is either Tr, 
Ok, or b{OK,). 

Since we can readily check that T-j and b{OK,) are not clutters of odd cycles this 
conjecture implies Conjecture 1. Next are two results on mni binary clutters. 

Proposition 2. Let A be a mni binary clutter and C\,C 2 G C{A). If C C 
Cl U C 2 and C G 12(A) then either C = C\ or C = C 2 . 

Proof. Let r denote the cardinality of the elements of 17(.4). 

Case 1: jCj = r. 

By Remark 5, we have C G 17(.4). Let U be the mate of C and q = \CnU\ > 
2. By Theorem 4, q is odd so in particular q > 3. Since C C Ci U C 2 , we 
must have ][/ n Ci] > 1 or jC/n C 2 I > 1. This implies that U is the mate of 
Cl or C 2 , i.e. that C = Ci or C = C 2 . 

Case 2: jCj > r. 

Let t = ]Ci n C 2 n C[. Since C C Ci U C 2 , it follows that 



lci = t+l(CiAC2)nc]. 



(3) 
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For T = Cl A C 2 A C, we have 

|T| = |(Ci n C2 n c) u [(Cl A C2) - c]| , c c Ci u C2 

= |Ci n C 2 n c| + |Ci A C 2 I - |(Ci A C 2 ) n c| 

= t+|CiAC 2 |-(|C|-i), by (3) 

= 2t+|Ci| + |C2|-2|CinC2|-|C| 

< IC 1 I + IC 2 I-ICI, i<|CinC 2 | 

< 2 r-(r+l), Ci,C 2 GC(^) 

Since A is binary we have that T is equal to, or contains an element of 
But |T| < r — 1 which contradicts Theorem 3(i) and Remark 5. □ 

Notice that for Ok^ the previous theorem simply says that given two triangles 
Cl, C 2 there is no odd cycle (distinct from Ci and C 2 ) which is contained in the 
union of Ci and C 2 . It is worth mentioning that this is a property of mni binary 
clutters only. Indeed the property does not hold for odd holes or more generally 
for any circulant matrix with the consecutive one property. For a description of 
many classes of mni clutters see [4] and [14]. 

Proposition 3. Let A be a mni binary clutter and B its blocker. For any e G 
E{A) there exist Ci,C 2 ,Cs G L2{A) and Ui,U 2 ,U^ G fI{B) such that 

(i) Cl n C 2 = Cl n C 3 = C 2 n C 3 = {e} 

(n) Cl n C 2 = Cl n c/3 = C 2 n C 3 = {e} 

(Hi) For all i,j G {1,2,3} we have: 

Ci n Uj = (ej if i ^ j, and [Ci n C^ | = g > 3, if i = j. 

(iv) For all Ci G Ui and ej G Uj with i,j G {1,2,3} 

3C G f2{A) with C n Ci = {ci} and C DUj = {e^}. 

Proof. Let r (resp. s) denote the cardinality of the elements of C{A) (resp. 

n{B)). 

(i) By Theorem 3(iv) there exist s sets Ci,...,Cs G i7{A) such that Ci — 
{e}, . . . , Cg — {e} are all disjoint. Moreover, exactly q = rs — n+l>2of 
these sets, say Ci, . . . , Cq, contain e. Finally, by Theorem 4 g > 3. 

(ii) Let Ci be the mate of Ci, where i G {1,2,3}. We know [Ci n Ci| > 1 and 
for all j G {1, . . . , s} — {i} we have [Ci F Cj\ = 1. Since Ci, . . . ,Cs only 
intersect in e and since by Theorem 3(i), |Ci| = s it follows by counting that 
e G Ci. Finally, by Remark 6 and the fact that e G Ci n C 2 n C 3 , we obtain 
Ci n Uj C {e} for all t yf j and i,j G {1, 2, 3}. 

(iii) Follows from (i),(ii) and the fact that Ci is the mate of Cj if and only if 
i=j. 

(iv) Let T = UiUUj — {ci, Cj}. Since A is binary so is its blocker B. By Propo- 
sition 2, there is no C G f2{B) with U C T. Thus E{A) — T intersects every 
element of f2{B). Since the blocker of the blocker is the original clutter, it 
follows that E{A) — T contains or is equal to, an element C of Q{A). Since 
C n Ci yf 0 and C FUj yf 0 we have by construction C n Ci = {ci} and 
cnc, = {e,}. 
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3 Outline of the Proof 

3.1 From Binary Clutters to Labeled Graphs 

Let {Q, S) be a minimally non weakly bipartite graph. Let A be the clutter of 
odd cycles of {G, A!) and B its blocker. As noted in Sec. 2.2, .4 (and thus B) is 
mni. Let e be any edge of E{A) and let Ui,U 2 , U 3 G il{B) be the sets defined in 
Proposition 3. We define 

R=Ui-{e} B = U 2 -{e} G = U 3 - {e} (4) 

and 

W = E(A) - (Cl U C 2 U C 3 ) U {e}. (5) 

Note that by Proposition 3(ii), R, B, G and W form a partition of the edges of 
G- It may be helpful to think of R, B,G,W as a coloring of the edges of G- 
Let C be any odd cycle of {G, E) with e ^ G. Since C G n{A) we have 
C n Ci yf 0 for z G {1,2,3} it follows that Cni?y^0,Cni3y^0,CnGy^0. 
Therefore, the minimally non weakly bipartite graph (G, E) satisfies the following 
property: 

(PI) Every odd cycle C of (G,E) that does not contain e, has at least one 
edge in R, one in B, and one in G. 

Consider edges Ci G R and Cj G B then a G Ui,ej G C 2 . Hence by Proposi- 
tion 3(iv), there is an odd cycle G G fi{A) with {d} = G DUi E C Cl R and 
{cjl = C n U 2 E G n B. Therefore, (G, E) also satisfies: 

(P2) For any a G R (resp. B, G) and ej G B (resp. G, R) there is an odd 
cycle G of (G, E) with the following properties. G does not contain e, 
the only edge of G in i? (resp. B,G) is Cj, and the only edge of G in H 
(resp. G, R) is Cj. 

3.2 Building Blocks 

Definition 1. We say that a sequence of paths S = [Pi, . . . ,Pt] forms a circuit 
if each path Pi has endpoints Vi,Vi+i and Vt+i = v\. We denote the set of edges 
EUR byE{S). 

The next result is easy. 

Lemma 1. Let S = [Pi,...,Pt] he a sequence of paths that form a circuit. If 
there is an odd number of odd paths in S then E{S) contains an odd cycle. 

Given two vertices vi and V 2 of path P, the subpath of P between vi and V 2 is 
denoted by P{vi,V 2 ) = P{v 2 ,vi). Given a set S' C P and e G P, we denote the 
set S — (ej by S—e. Let G be a cycle, vi,V 2 two vertices of G and e an edge 
of G. Using the notation just defined we have that G— e(ui, U 2 ) defines a unique 
path. Next comes the first building block of the odd P5. 
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Lemma 2. Let {Q, E) be a minimally non weakly bipartite graph with a partition 
of its edges as given in (4)-(5). 

(i) There exist odd cycles Cr Q RUW,Cb Q BUW and Cq C GU VL which 
intersect exactly in e. Moreover, \Cr n R\, \Cr H B\, \Ca n G\ are all even 
and non- zero. 

(ii) Cr,Cb and Cq have only vertices w\ and W 2 in common, where e = 
{WI,W2). 




Fig. 1. Lemma 2. Bold solid lines represent paths in U W—e, dashed lines paths in 
B U W—e, and thin solid lines paths in G U W—e. 



Proof (of Lemma 2). 

(i) Let us rename sets C\,C 2 ,Cz of Proposition 3 by Cr,Cb,Cg. We know 
from Proposition 3(i) that Cr, Cr and Cq intersect exactly in e. By Propo- 
sition 3(iii), 0 = Gfi n (C /2 U C /3 - {e}) = n (B U G). Thus CrCRUW. 
Also by Proposition 3(iii), |G/?ni?| = |Gfln(C/i — {e})| = \CrC\Ui\- 1 = q—1 
where q > 3 is odd. Identically we show, Cr C B LI W, Cq L GLW and 
\Gb n B\, \Gg n G| both non-zero and even. 

(ii) Suppose, for instance, Gr and Gr have a vertex t distinct from and W 2 

in common. Let P = GR—e{wi,t),P' = GR—e{w 2 ,t) and Q = CR—e{wi,f), 
Q' = CR—e{w 2 ,t), see Fig. 2. Since we can relabel edges in (5({wi}) and 
in <5({/}) we can assume w.l.o.g. that edge e = (wi,W 2 ) is odd and that 
paths P, P' are both even. If Q is odd then let S = [P, Q] otherwise let 
S = \{e],Q,P']. By Lemma 1, E{S) contains an odd cycle, a contradiction 
to Proposition 2. □ 

Since Gr, Cr and Gq have only vertices wi and W 2 in common, we can relabel 
{Q, E) so that e is the only odd edge in Gr U Cr U Gq. Let us now proceed to 
add 3 more paths Pr, Pr and Pq to our initial building block. 




A Characterization of Weakly Bipartite Graphs 



17 




Fig. 2. Lemma 2(ii). 



Lemma 3. Let {Q, E) be a minimally non weakly bipartite graph with a partition 
of its edges as given in (4)-(5). Suppose we also have odd cycles Cr,Cb and Cq 
as defined in Lemma 2 where e is the only odd edge in Cr U Cb U Cq- 

(i) There is an odd path Pr (resp. Pb,Pg) between a vertex vbr (resp. 
vrBiVbg) of Cb (resp. Cr,Cb) distinct from W\,W 2 , and a vertex vgr 
(resp. vgb,vrg) of Cg (resp. Cg,Cr) distinct from wi,W 2 . 

(a) Pr C i? U W—e, Pb C B LI W—e and Pg C G U W—e. 




Fig. 3. Lemma 3. 



Proof (of Lemma 3). By symmetry it is sufficient to show the result for path 
Pr. Since \Cr n i?| > 2 there is an edge cr = {vbr, "o'br) G B of Cb such that 
Vbr is distinct from w\,W 2 and CB—e{wi,VBR.) contains exactly one edge in B 
namely cr. Similarly, we have edge cg = {vgr,Vqr) G C'g with vgr distinct 
from , W2 and Gq— e(wi , vgr) H G = {cg}- 

By property (P2) there is an odd cycle G such that C C\ B = {cr} and 
Gn G = {cg}. The cycle G can be written as {cr, bg} L PrL Pf^ where Pr and 
P'j^ are paths included in i? U W—e. Since G is odd we can assume w.l.o.g. that 
Pr is odd and is even. 
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Case 1: The endpoints of Pr are v'^^^vgr (resp. vbrjVqj^). 

Then let S = [Cb— e(wi, Pr, Ca—eivcR, wi)]- By Lemma 1, E{S) con- 
tains an odd cycle but e ^ E{S) and E{S) n B = 0, a contradiction to 
(PI). 

Case 2: The endpoints of Pr are 

Then let S = [Cb— e(wi, Pb, Cq— tci)]. By Lemma 1, E{S) con- 
tains an odd cycle but e ^ E{S) and E{S) H B = E{S) n G = 0, a contra- 
diction to (PI). 

Thus Pr has endpoints vbr,vgr- □ 

An internal vertex of a path P is a vertex of P which is distinct from the end- 
points of P. By choosing paths Pr, Pr, Pg carefully we can show the following 
additional property: 

Lemma 4. Pr, Pr and Pg have no internal vertices in common with Cr, Cr 
or Cg- 

Remark 7. Let (S, E) be a labeled graph with an odd path P where all internal 
vertices of P have degree two (in G). Then there is a sequence of relabeling 
and contractions that will replace P by a single odd edge, without changing the 
remainder of the graph. 



Remark 8. Consider (Q, E) as defined in Lemma 3. If Pr, Pr, Pg have no inter- 
nal vertices in common, then (Q, E) contains a minor. 

This is because in this case the following sequence of operations yields K^: 

1. delete all edges which are not in Cr, Cr, Cg and not in Pr, Pr, Pg, 

2. contract CR—e{vRG,VRB),CB—e{vBR,VRG),CG—e{vGR,VGB), 

3. relabel edges in ddrui, W 2 }), 

4. replace each odd path by a single odd edge (see Remark 7). 

3.3 Intersections of Paths Pr, Pr, and Pg- 

Because of Remark 8 we can assume at least two of the paths Pr, Pr and Pg 
must share an internal vertex. One of the main step of the proof of Theorem 
1 is to show the following lemma (we give a simplified statement here) which 
describes how paths Pr,Pr and Pg must intersect. 

Lemma 5. A minimally non weakly bipartite graph {Q, E) is either a relabeling 
of ATs or it contains a contraction minor {H, E-r) with the following properties 
(see Fig. 4 )’ 

(i) There are odd paths Pr, Pr, Pq 0 / ^^6 form given in lemmas 3, 4 but 
with vertices v'rg,'^'gb instead of vrg,vgb ■ 

(n) P'RQR,P'gCB,P(.CG. 
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(Hi) P'j^ and Pg (resp. Pq) share an internal vertex tus (resp. tua)- 

(iv) P'BitRBjVcB) Pq share an internal vertex tBC- 

(v) Pg and PQ^tuG^Vgo) share an internal vertex t'gQ. 

(vi) Paths Pg{vBRitRB) and PgivGRitRG) consist of a single edge. 

(vii) Pg (resp. Pq) has exactly one odd edge which is incident to VQg 
(resp. v'ga). 

(via) PB{vBR,tRB),PgivGR,tRG) are even and PR(tRB,tRG) is odd. 
(ix) No vertex is common to all three paths Pr,Pb and Pg. 




Fig. 4. Lemma 5. Graph {T~L, Eg). 



We did not represent vertices Irg (and t'gQ) in the previous figure. Let q denote 
the first vertex of Pq, starting from vrg, which is also a vertex of P'b{Irb,v'qb) 
(see Fig. 4). By Lemma 5(iv) there exist such a vertex. Let q' be the first vertex 
of P'Q{q,VRG), starting from q, which is either a vertex of Pg or equal to vrg. 
By Lemma 5(ix) q, q' are distinct from Irr. 

Definition 2. (IC,E]c) is the graph (see Fig. 5) obtained by deleting every edge 
of [71, Efi) which is not an edge of Cr,Cb,Cg or PR{vBR,tRB), Pr and P'Q{q,q'). 

From Lemma 5 we can readily obtain the following properties: 

Remark 9. 

(i) There are exactly two odd edges in (/C, E^), namely e and the edge of P'r 
incident to v'qr. 

(ii) Let S be the set of vertices {wi,W 2 ,VBR,VRB,v'QR,tRR,q,q'} shown in 
Fig. 5. vrb and q' may denote the same vertex but all other vertices of S 
are distinct. 

(iii) S is the set of all vertices of (/C, Ex) which have degree greater than two. 
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Fig. 5. Graph {K., Ek), Left q' = vrg, right q' 7^ vrg 



Definition 3. Let Irb be the vertex of {H, E-h) defined in Lemma 5(iii). e = 
{tRBjt'iiB) denotes the edge of P'B{tRB,VRB) incident to tRB (see Fig 5). Note, 
tfig need not he distinct from vrb ■ 



Lemma 6 . Let {Ti.,En) be the graph defined in Lemma 5. There are three dis- 
tinct odd paths Fi, F 2 , F 3 from tRB to tfi^. 

Proof. By applying Proposition 3(i) to edge e we obtain odd cycles Li,L 2 ,L^ 
of {G,E) which intersect exactly in e. Let L[,L' 2 ,L'^ be the corresponding odd 
cycles in {H, En) and Fi, for z G {1,2, 3}, denotes the path L'i—e. Since e is even 
and L' is odd we must have Fi odd as well. □ 



Lemma 7. Let {1C, Ejc) be the graph given in Definition 2 and let F\,F 2 ,F^ 
he the paths given in Lemma 6. Then F\,F 2 ,F^ all have an internal vertex in 
common with {IC,Ek:). 

Proof. Suppose for a contradiction this is not the case and we have Fi with 
no internal vertices in common with {IC,Ejc), see Fig. 6. Consider the graph 
obtained from {H, E-r) by deleting e = (tRB,t'RB) edges which are not 

edges of (/C, Ejc) or edges of Fi. The following sequence of operations yields K^: 

1. relabel edges in Jdg}), 

2. if q' = Vrg then contract CR—e{vRG,VRB) otherwise contract P'b{<i' ,vrb), 

3. relabel edges in Jdg}), 5({rci, W 2 }), 

4. contract P'b{^i'v'gb) CB—e{t,VBR), 

5. replace odd paths by odd edges, see Remark 7. 
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Fig. 6. Lemma 7. We represent the case where q' = vrg only. 



Hence {Q, S) contains as a proper minor, a contradiction since {Q, S) is 
minimally non weakly bipartite. □ 

Let {H, Sfi) be the graph obtained by deleting from (7f, Sn) all the edges 
which are not edges of Cr, Cb,Cg and not edges of P'^ and Pq. Because of 
Lemma 7 we can define fi, for i = 1,2,3, to be the first internal vertex of Fi, 
starting from tRs, which is also a vertex of (7Y, Sr). By symmetry (see Fig. 4) 
there is a vertex a vertex t'^Q of PQ{tRG, vrg) which is incident to Irg and there 
are odd paths F[, ^3 between Irg and As previously we can define /', 

for i = 1,2,3, to be the first internal vertex of P/, starting from Irg, which is 
also a vertex of (7Y, ^n)- 

The remainder of the proof is a case analysis which shows that for each 
possible set of vertices fi,f 2 ,fs and /(,/ 2,/3 the graph {H,Sr) contains a 
PTs minor. In order to prove Lemma 5 and to make the case analysis tractable 
we first establish general results for labeled graphs with properties (PI) and (P2). 



Acknowledgments. I am most indebted to Prof. Gerard Cornuejols and Prof. 
Frangois Margot for their help. 
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Abstract. We investigate the solution set of the following matrix equa- 
tion: B — J + diag{d), where A and B are n x n {0, 1} matrices, J is 

the matrix with all entries equal one and d is a full support vector. We 
prove that in some special cases (such as: both Ad~^ and have 

full supports, where d~^ = • • • i both A and B have constant 

column sums; d~^ • 1 —1, and A has constant row sum etc.) these so- 

lutions have strong structural properties. We show how the results relate 
to design theory, and then apply the results to derive sharper charac- 
terizations of (a, a;)-graphs. We also deduce consequences for ’’minimal” 
polyhedra with {0, 1} vertices having non-{0, 1} constraints, and ’’mini- 
mal” polyhedra with {0, 1} constraints having non-{0, 1} vertices. 



1 Introduction 



Suppose we are given two n x n {0,1} matrices A and B, and a full support 
vector d. Let us call the pair of matrices (A,B) a {bipartite) d- design if 

B = J + diag{d), 



where J is the matrix filled with ones. It seems difficult to say anything about 
the structure of the matrices A and B in such a general setting. But if d > 0 
then a surprising result of Lehman [7] asserts that either 



A = B^ DPP = 



0 1 
1 I 



(then we call the pair {A,B) a DPB-design), or for some r and s: 

AJ = JA = rJ] BJ = JB = sJ; A^ B = BA^ = J + {rs - n)I 



(then we call the pair {A, B) an (r, s)-design). This result generalizes the earlier 
results of de Bruijn and Erdos [2] and Ryser [12], and it is one of the main 
arguments in the proof of Lehman’s theorem on minimally non-ideal polyhedra 
[ 8 ]. 

In this paper we would like to investigate the d-designs a bit more generally. 
Our main goal is to find sufficient conditions which force a d-design to become 
an (r, s) -design. The following theorem summarizes our results in that direction: 



R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 23-36, 1998. © Springer-Verlag Berlin Heidelberg 1998 
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Theorem 1. Let {A, B) he a d-design. Then: 

1. If n > 3 and both and are full support vectors, then {A, B) 

either is an (r, s) -design or a DPP-design; 

2. If AJ = rJ and d“^ • 1 ^ —1, then {A, B) either is an (r, s)-design or r = 2 
(and we characterize all the possible such designs); 

3. If A and B have constant column sums, then {A,B) is an {r, s)-design. 

Our proof (see Section 7) uses widely the ideas of Lehman [8], Padberg [10], 
Seymour [15] and Sebd [14]. In Sections 4 and 5 we state and prove the two 
main ingredients of the proof: de Bruijn and Erdos Lemma [2] and a lemma 
from matrix theory, which contains, as a special case, our key argument from 
linear algebra due to Ryser [12], Lehman [7] and Padberg [10]. In Section 6 we 
discuss the applications of the lemma to the classical design theory. In Section 
8 we apply Theorem 1 to get some new characterizations for (a, w)-graphs. In 
Section 9 we use Theorem 1 to characterize fractional vertices and facets of 
certain types of minimally non-{0, 1} polyhedra. Our characterizations imply 
Lehman’s theorem on minimally non-ideal polyhedra [8], Padberg’s theorem on 
minimally imperfect poly topes [11] and the part of Sebd [14], which correspond 
to nonsingular matrix equations. Matrix generalization of the singular case is 
going to be considered in the forthcoming paper [5] . In Sections 2 and 3 we give 
some preliminaries and discuss some basic examples. 

2 Notations and Terminology 

The following conventions and terminology is used throughout this article. We 
use lower case boldface letters to name vectors and upper case letters to name 
matrices. In particular, 1 and 0 denote vector of all one and the zero vector, 
respectively, and / and J denote the identity matrix and the all one matrix of 
suitable size, denotes the zth unite vector and denotes the (z, j)th element 
of I. For a vector x, Xi is its zth coordinate. Similarly, if A is a matrix then 
Oij denotes its (z,j)th element, a^. denotes its zth row and a j denotes its jth 
column. Also, diag{d) denotes the diagonal matrix made up of the vector d, and 

d~^ = (df\...,d-T- ^ 

We will not distinguish the matrix from the ordered set of its columns. A 
matrix with {0,1} entries we call a {0, 1} matrix and a linear constraint we call 
a {0, 1} constraint if it has {0,1} coefficients and {0,1} RHS. A = B means that 
A can be obtained from B after permutation of its columns and rows. 

If X = {xi, . . . ,Xn), then x/z := (xi, . . . ,Xi-i,Xi+i, . . . ,x„) and if A is a 
{0,1} matrix, then 

A/i := {a,j/i : for each j such that a^- = 0}. 

The dot product of column (row) vectors a and b is a • b := a^b (a • b := ab^). 

If A is a set system, then A denotes its (point-set) incidence matrix (and vice 
versa), and denotes its dual set system, i.e. the (point -set) incidence matrix 
of A^ is congruent to A^. 
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A pairwize balanced design (with index A) is a pair (V, B) where is a set 
(the point set) and B is a, family of subsets of V ( the block set) such that any 
pair of points is in precisely A blocks. A square design is a pairwize balanced 
design with equal number of points and blocks. A symmetric design is a square 
design with blocks of constant size. A projective plane is a symmetric design 
with index 1. If 






0 1 
1 I 



= DPP, 



then A is called a degenerated projective plane. 

Let A be an n X n {0, 1} matrix. G(A) denotes a graph, vertices of which are 
the columns of A and two vertices are adjacent iff the dot product of the corre- 
sponding columns is not zero. We say that A is connected if G(A) is connected. 

If X and are sets of indices, then A\X\J\ denotes the submatrix of A 
induced by the rows X and columns J . 



Definition 1. We say that A is r -uniform (r -regular) if JA = rJ{AJ = rJ). 
We say that A is totally r-regular if JA = AJ = rJ. 



Definition 2. We say that annxn {0, 1} matrix A is a DE-matrix (de Bruijn- 
Erdds’s matrix) if for each pair i and j with Uij = 0, the number of zeros in the 
ith row is equal to the number of zeros in the jth column. 

Definition 3. Let (A,B) be a d-design. + ^7^ = 0 then we call such a 

design singular. The ith row of A (B) we call a d-row z/ai.d~^ = 0 (b^.d”^ = 0). 

Let P be a polyhedron. We say that P is vertex {0, 1} if all its vertices are 
{0, 1} vectors. We say that P is facet {0, 1} if it can be given with the help 
of {0,1} constraints. We call P a {0 , 1} -polyhedron if it is both vertex {0,1} 
and facet {0, 1}. P/j denotes the orthogonal projection of P on the hyperplane 
Xj =0, and P\j denotes the intersection of P with the hyperplane Xj = 0. The 
first operation is called contraction and the second one deletion of the coordinate 
j. A polyhedron P' is called a minor of P if it can be obtained from P by 
successively deletion or contraction one or several coordinates. 

For an m X n {0,1} matrix A, we denote by P<{A) = {x G P" : Ax < 
l;x > 0} the set packing polytope (SP-polytope) and by P>(A) = {x G P" : 
Ax > 1; X > 0} the set covering polyhedron (SC-polyhedron) associated with A. 

A {0, 1} SP-polytope is called perfect, and a {0, 1} SC-polyhedron is called 
ideal. A SP-polytope P is called minimally imperfect if it is not perfect, but all 
its minors are perfect. A minimally non-ideal polyhedron is defined similarly. 

It is easy to see that P>(DPP) is a minimally non-ideal polyhedron. 

For more information on polyhedral combinatorics we refer to Schrijver [13]. 

If G = (y, P) is a graph, then n = n(G) denotes the number of vertices of G; 
ui = w(G) denotes the cardinality of a maximum clique of G; a = a(G) denotes 
the cardinality of a maximum stable set; and y = y(G) denotes the chromatic 
number of G. A fc-clique or fc-stable set will mean a clique or stable set of size 
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k. A graph G is called perfect if x(H) = ui{H) for every induced subgraph H of 
G. A graph G is called minimal imperfect if it is not perfect, but all its proper 
induced subgraphs are perfect. G is called an {a,uj)-graph (or partitionahle), if 
n = q;u;+ 1, and V {G)\v can be partitioned both into w-cliques and into a-stable 
sets, for every v G V (G) (here we assume that the empty graph and the clique 
are (a, w)-graphs). Lovasz [9] proved the following important theorem: 

Theorem 2. If G is minimal imperfect, then it is an {a, co)- graph. 

Theorem 2 provides the only known coNP characterization of perfectness, 
and it is used by Padberg[ll] to show further properties of minimal imperfect 
graphs. For more on (a,w)-graphs we refer to Chvatal et al. [3]. 



3 Constructions Associated with Bipartite d-Designs: 
Some Basic Examples 

There are several ways to associate a combinatorial structure to a d-design {A, B) . 
A straightforward way to do it is to take two set systems A = {Ai , . . . , A„} and 
B = {Bi, . . . , Bn} on some ground set V = {fi, . . . , u„} such that the matri- 
ces A and B are (point-set) incidence matrices of A and B, respectively. Then 
the pair of set systems {A,B) have the following property: for each I < i,j < 
n, \AiC\ Bj\ = 1 + Cijdi. We call such a pair of set systems a d-design. In par- 
ticular, it was proved by Padberg [11] (see also [3]) that the pair of set systems 
of w-cliques and a-stable sets of an (a, w)-graph is an (a, w)-design. Another 
interesting special case is when A = B. It was proved by de Bruijn and Erdos 
[2] that {A, A) is a d-design iff A^ is a (may by degenerated) projective plane. 

A d-design can be characterized with the help of just one set system 
Then the ith column of B can be interpreted as an incidence vector of a set 
subsystem of which contains all the points except Ai by exactly once and Ai 
by exactly di + 1 times. We call such a set system a d-hypergraph. A d-hypergraph 
corresponding to a (r, s)-design we call an (r, s) -hypergraph. In particular, — 1- 
hypergraph is a hypergraph having equal number of edges and vertices such 
that for each vertex v, V\v can be partitioned with the help of its edges. We 
will show that a —1-hypergraph is an (a, w)-hypergraph, which corresponds to 
the w-clique hypergraph of some (a, w)-graph. 

An interesting special case is when is 2-uniform, i.e. it is a graph. A 
nonsingular d-design {A,B) we call a G-design, if A is 2-regular. A graph G we 
call a d-graph if there exists a G-design (A,B) such that A is the (edge- vertex) 
incidence matrix of G. If G is an odd cycle, then we call (A, B) a G-design. 

Let G be a d-graph. Then it is not difficult to show that, for each 1 < i < n, 
di= ±1 (see Lemma 9). Denote by G\v {G/v) the graph obtained from G after 
deleting (duplicating) the vertex v. It is easy to see that, for each vertex v, either 
G\v or G/v has a perfect matching. Call such a graph matchable. The following 
lemma characterizes d-graphs: 
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Lemma 1 . G is a d-graph iff it is a connected graph with odd number of vertices 
and exactly one odd cycle such that the distance from each not degree two vertex 
to the cycle is even. 

Proof. We will prove by induction on the number of vertices. Suppose the theo- 
rem is true for the graphs having less than n vertices and G is a d -graph with n 
vertices. Then, clearly, G is connected, has equal number of edges and vertices, 
and odd number of vertices. Hence G has exactly one cycle, which is odd, as the 
(edge-vertex) incidence matrix of G is nonsingular. Furthermore, if v\ is a leaf of 
G, and V\V2 G E{G), then G\{ui;u2} is matchable. Indeed, for each v yf ui,f2, 
the perfect matching of G\v (or G/v) must contain the edge V\V2. Hence after 
deleting the edge viV2 from the matching, it will be a perfect matching for G\v 
(or G/v). It follows that either V2 has degree 2, or it is a vertex of the cycle and 
has degree 3. 

Let U3 yf v\ such that U3U2 G E. Now if V2 has degree two then G\{ui; V2} is a 
d-graph. Hence by induction hypothesis, the distance from each not degree two 
vertex u yf ui; U3 to the cycle is even. If vi is the unique leaf of G nonadjacent 
to the cycle, then the degree of U3 in G\{ui; V2} is one, hence the distances from 
vi and V3 to the cycle are also even. If V4 yf is a leaf nonadjacent to the cycle 
and V4V5 G E then, by induction hypothesis, G\{t>4;u5} and G\{ui; U2; U4; U5} 
are a d-graphs, hence the distances from vi and V3 to the cycle are again even. 

If G has no leafs then it is an odd cycle and we have nothing to prove. Suppose 
G has leafs, but all of them are adjacent to the cycle. Then it is easy to see that G 
has exactly two leafs, the neighbors of which are adjacent vertices in the cycle. 
Denote by Vi the set of vertices of G such that G\v has a perfect matching 
and by V2 = Then it is easy to see that G/v has a perfect matching iff 

u G V2 and I Hi I = IV2I + 1- Now if (A,B) is a d-design corresponding to G, then 
d~^ • 1 =|Vi| — IV2I = —1, a contradiction. 

The sufficiency of the condition is proved by similar arguments. 

It is possible to associate a full dimensional simplex to a nonsingular d-design 
as follows. Denote by y the unique solution of B^y = 1, and by P{A,B) = 
convex hull {a.i, . . . , a.„, y}. The facets of P{A, B) containing the vertex y are 
h j • X = 1. Hence P{A,B) has at most one non-{0, 1} facet, and at most one 
non-{0, 1} vertex, which is not in that facet. 

Conversely, suppose y is a non-degenerated vertex of a polyhedron P such 
that all its neighbors and the facets containing it are {0, 1}. Then we can asso- 
ciate a d-design (H, B) with y taking as columns of A the neighboring vertices 
of y and as columns of B the supports of the facets of P containing y such that 
the vertex corresponding to the zth column of A is not in the facet corresponding 
to the zth column of B. We call such a vertex a d-design vertex. Similarly, if E 
is a simplicial facet such that all its vertices and neighboring facets are {0, 1}, 
then we associate a d-design with F and call it a d-design facet. 
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4 De Bruijn-Erdos’s Matrices 



In this section we summarize the information about DE-matrices, which we use 
in this paper. 

Lemma 2 ([2]). Let be (iTi Ti X Tn {0, 1} Tfiatvtx wtthout all otic coIuuiits. If 
n < m and for each pair i and j with Oij = 0, the number of zeros in the ith 
row is less or equal than the number of zeros in the jth column, then A is a 
DE-matrix. 



Proof. Denote by Wj = l/(n — 1 • a.^) and w = {wi, . . . ,WmY' ■ Then 



(l^-a,.)w= 



n — 1 • a 






m — ai 1 



= 1 , 



aij—Q 



aij—Q 



and m = 1^(J— 4)w <n. Hence we should have equality throughout, i.e. m = n, 
and if Oy = 0 then a^, 1 = 1 • a. j . 

Lemma 3. If A is a DE-matrix, then it has an equal number of all one rows 
and columns. 

Proof. Delete all one rows and columns and apply Lemma 2. 



Lemma 4. Let A and A' be DE-matrices, where a.iyfa' i and a j = afj, 1 < 
j < n. Then either A or A! has an all one column. 

Proof. Indeed, if say an = I and = 0, then a^. = 1^. Hence by Lemma 3, A 
has also an all 1 column. 



Lemma 5. If B is a DE-matrix and J—B is connected, then B is totally regular. 

Proof. If (1 — b,i) • (I — b.j) yf 0, then there exists an index k such that bki = 
bkj = 0, hence 1 • h j = b^.l = 1 • b.^. As J — B is connected, it follows that 
JB = sJ, for some integer s. Since B has no all 1 column, and it is a DE-matrix, 
it cannot have an all 1 row. Hence B is totally s-regular. 

The following result, which has been extracted by Sebd [14] from Lehman [8] 
(see also [15] and [10]), is one of our key arguments in the proof of Theorem 13. 
Denote by A* the set of all the solutions of A^x = 1, and by A^^ the set of all 
{0, 1} vectors in A*. 

Theorem 3 ([8,14]). Let A be a nonsingular {0, 1} matrix without an all one 
column. If the vector x G A* has full support, and for each i. 



then A is a DE-matrix. 



x/t G linear hull {A/if^ , 
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Proof. Let Bi C [A/i)^^ be a matrix such that the equation Biy = x/t has a 
unique solution. As x/i has full support, Bi has no all zero row. 

Suppose Oij = Q, L = {I : ay = 1} and B[ is the submatrix of Bi induced by 
the rows L. Then all the columns of S' have exactly one 1. Thus we have: 

a^.l =n — rk(A/t) > rk(Si) = l^S'l > 1 • a.j, 

and we are done by Lemma 2. 

The next important result of Lehman [8] will be used to prove Theorem 12. 

Theorem 4 ([8]). Let A he (in n x m {0, 1} Tfiotvix hcLVtnfj full row vcink wnd 
no all one or all zero or equal columns. If the vector x G A* has full support, 
and for each i, affine hull (A/i) can be given with the help of {0, 1} constraints, 
then A is a DE-matrix. 

Proof. Suppose A! = {a.i, . . . ,a.„} is nonsingular. Then for each 1 < z < n : 

x/z G (A/z)* C linear hull (A/z)°^ C linear hull {A /z)°^, 

hence by Theorem 3, A! is a DE-matrix. 

If zz < TO, then there must exist i <n and j > n such that A" = {A'Ua.j}\a.i 
is again nonsingular. But then A" also is a DE-matrix and by Lemma 4, A has 
an all 1 column, a contradiction. 



5 A Lemma on Matrices 

The following lemma contains our main argument from linear algebra. Though 
we need just a very special case of the lemma, we would like to state it in general 
form, for the sake of possible other applications. 

Suppose A,B,D G U,W G where D is nonsingular and U 

(or W) has full column rank. 

Lemma 6. det(Z7 -I- UW^) yf 0 iff det A yf 0 and then 

A^B = D + UW^ BD~^A^ = 1 + XA~^Y^, 
where A = W'^ D~^U + 1; X = BD~^U; = W’^D~^A^. 

Proof. Denote by 

-z\ 0 ■ 

0 d\ ’ 

Now if A^B = D + UW'^ , then A^X = UA; Y^B = Z\TT^; and 



F = 



Y A 



;E = 



-A -W^ 
X B 



;Dn = 



Y^x = W^D-^A^BD-'^U = W^D~^{D + UW^)D~'^U = - A. 
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It follows that B = D + UW^ <G> E = Dq. As BD~^U = U A, and U 
has full column rank, the singularity of A implies the singularity of either A or 
B. If A is nonsingular, then both F and E are nonsingular, and 

F'^E = Do^ EDg^E^ = / <^ BD-'^A^ = 1 + XA~'^Y^. 

Since A^XA~^ = U and B = A and B are also nonsingular. 

Notice that if the inverse of the matrix D is easy to compute, then Lemma 
6 reduces the singularity test of the n x n matrix D + UW^ to the singularity 
test of an m X m matrix Z\. 

Taking m=l, U=W = 1 and D = diag{d) we get: 

Lemma 7 ([7,10]). If (A,B) is a nonsindular d-design, then BD~^A^ = I + 
(5“^xy^, where x = y = Ad~^, and 6 = d^^ -1 + 1. 

It follows that if a^, is a d-row of a nonsingular d-design (A,B), then for 
each k < n, Yl’^=i^ijbkjd~"^ = Cik- On the other hand, if for some i and 
k, X)j=i o-ijbkjdj^ = Cik, or, in particular, i ^ k and a^, • b^, = 0, then ei- 
ther a^. or bfe. is a d-row. 

The following simple lemma will also be useful in the sequel. 

Lemma 8. If {A, B) is a d-design, then the columns of A and B are affinely 
independent. Moreover, if B is singular then i?d~^ = 0. 

6 Applications to the Block Design 

Lemma 6 has some important consequences in the classical design theory. In 
particular, it implies that if d has full support, then A J -I- diag{d) is nonsingular 
iff Ad~^ • 1 yf —1. This simple fact, which can be easily proved directly, was used 
by Bose [1] (see also [12]) to prove the well-known Fisher’s inequality asserting 
that in a pairwize balanced design the number of blocks is greater or equal than 
the number of points. 

Theorem 5 ([12]). [1] Let A = {Ax, . . . ,Am}, where Ax, .. . ,Am are distinct 
subsets of a set of n elements such that for each i ^ j, 1 < \Ai C) Aj\ = A < n. 
Then m < n. 

Proof. If one of the sets has A elements, then all the other sets contain this one 
and are disjoint otherwise. It follows that m < n. Hence we may suppose that 
di = \Ai\ — A > 0. Then A =diag{d) + XJ. Since Ad“^ • 1 yf —1, m < rk 
A < n. 

Here is another useful consequence of Lemma 6: 



Corollary 1. LetA,BGlZ^^^. IfA^B = XJ + {k-X)I, where \{1 - n) k, 
then 

A^B = BA^ ^ JB = rJ] JA = sJ ^ BJ = rJ; AJ = sJ. 
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Proof. By Lemma 6 we have: 

bA^ = A(A(n - 1) + k)-^BJA^ + {k- A)/. 

Hence B = BA^ BJAA = tJ, for some t BJ = rJ;AJ = sJ, where 
rs = A(n — 1) + A: JB = rJ; JA = sJ, where rs = \{n — 1) + fc (as A^ B = 

\J+{k-\)I). 

Thus, we get the following well-known result of Ryser [12]: 

Theorem 6 ([12]). The dual of a symmetric design is again a symmetric de- 
sign. 

Proof. Suppose AA^ = (fc — A)/ + XJ. Then JA = kJ and by Corollary 1, 
AJ = kJ and A^A = AA^ = {k - A)/ -k XJ. 

The following interesting fact also can be easily deduced from Lemma 6. 

Theorem 7 ([12]). In any square design, there exists a set incident to each 
given pair of points. 



7 Some Sufficient Conditions 



The following theorem completely characterizes the d-designs {A, B), where J — 
A is disconnected (the proof is omitted) . 

Theorem 8. Let (A,B) be a d-design. If it is not a DPP-design and J — A is 
disconnected, then the following cases are possible: 



1 . 



2 . 



3. 



I 



{A,B) 



/ [1 0 0 
110 
\ [l 0 1 



1 0 0 ] \ 
10 1 
1 1 oj / 



(A,B) 



(A,B)^ 



( 


'1 ef ■ 




1 o' 






1 J-7 


5 


ei I 


) 


'0 1 l' 




'1 0 l' 


IJ-I J 


5 


0/0 


[1 


0 I 




0 0/ 



{A,B) 



f 


'10 11 ' 




'110 0 ' 


\ 


0 0 11 




0 0 0 1 






110 0 


5 


10 0 0 




[ 


110 1 




0 0 11 


/ 
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Theorem 9. If {A, B) is a d-design without d-rows, and n > 3, then either 
(A,B) is an (r, s) -design or it is a DPP-design. 

Proof. As both Ad~^ and Bd~^ have full supports, it follows from Lemma 8 
that {A, B) is nonsingular. If either A or B has an all one column, then it is not 
difficult to show that n < 3. So suppose that A and B have no all one columns. 
Denote by U = {I : aij = 1}; = {k : Oik = 1}; and = B[pA^ K^]. Let 

Oij = 0. As a.j • h k = 1, for each k G AT*, and a^, • b/. > 1, for each I G U 
(by Lemma 7), it follows that each row of contains at least one 1 and each 
column of contains exactly one 1. Hence \K'^ \ >\lJ\, for each pair i and j 
such that Oij = 0. Therefore, by Lemma 2, \K'^ \=\U\, is a permutation 
matrix, and both A and B are DE-matrices. Moreover, if a^- = 0 and aij = 1, 
then a.i ■ b;, = 1. Suppose now that a^, > a/,. As by Lemma 3, A has no all one 
row, there exists a j such that aij = aij = 0. Since A is a DE-matrix, a^. = a/., 
a contradiction. Hence a^. • b;, = 1, for each i ^ 1. That is BA^ = J+ diag{d'), 
for some vector d'. 

Now, it follows from Theorem 8 and Lemma 5 that either (A, B) is a DPP- 
design, or both A and B are totally regular. Suppose A is totally r-regular and 
B is totally s-regular. Then rsJ = JA^B = nJ Jdiag{d), hence di = c ?2 • • • = 
dn = rs — n, and A^B = J (sr — n)I. Similarly, BA^ = J -\- {sr — n)I = B. 

Notice that in the proof we are using just a very special case of Theorem 8 
(when both A and B are DE-matrices), which can be easily proved directly. 

As a consequence we get the important result of Lehman on the structure of 
square minimally non-ideal matrices. 

Corollary 2 ([7]). If (A,B) is a d-design, where d > 1, then either (A,B) is 
an {r, s)-design or it is a DPP-design. 

Here are two other useful consequences: 

Corollary 3. If B = J—I, then (A, B) is an {a, lo) - design, where n = auj-\-l. 



Corollary 4. If (A, B) is a d-design, where A and B both are uniform, then it 
is an {r, s)-design. 

Proof. If A is uniform, then the solution of A^x = 1 has full support. Hence 
Hd^^ also has full support and B has no d-rows. 

For our next result we need the following nice lemma of Sebd [14]: 

Lemma 9 ([14]). If{A,B) is a d-design, where A is r-regular, then, for each 
j < n, dj = —n (modr). 

The following theorem characterizes the nonsingular d-designs (A, B), where 
A is regular: 

Theorem 10. If (A, B) is a nonsingular d-design, where A is r-regular, then 
either it is an (r, s) -design or a G-design. 
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Proof. Suppose (A,B) is not an (r, s)-design. Then by Lemma 9 and Theorem 
9, for each j, either dj = —1 or dj > r — 1, and by Theorem 9, either A or B 
has a d-row. Consider two cases: 

Case 1: ai, is a d-row. Now, it follows from Lemma 7 that for each z yf 1, 
either ai. -bi. =0 or ai. < b^,. As B has no equal columns, it follows that r = 2, 
dj = ±1, hence (A,B) is a G-design. 

Case 2: bi. is a d-row having maximum number of ones. 

Suppose bi. = (1 ... 1, 0 ... 0), where bi.l = k. Then we have that for each 
z yf 1, either a^. • bi. = 0 or a^, • bi, = r. Moreover, ai. • bi. = r — 1. It follows 
that r < k < n. Suppose a^. • bi. = r if 1 < z < I, a^, • bi. = 0 if z > ^, and 
aifc+i = 1. Denote by Ai = A[l+1 . . .n; k+1 . . .n], Bi = B[l+1 . . . n; fc-|-l . . . n]. 
As A is nonsingular, I > k. On the other hand, AfBi =diag(dk+i ■ ■ ■ dn) + J, 
where Y^^=k+i 7^ hence fc > L It follows that k = I and 

Ai is nonsingular. Hence the equation Afyi = 1 — ei has a unique solution. 
As all the columns oi B' = B[k + \ . .n]l . . .k] satisfy that equation, B' has 
an all one row. Suppose bp. is the row of B corresponding to that row of B' . 
As E"=i a2jbpjd-^ = 0, bp, is a d-row having more ones than bi., which is a 
contradiction. 

Notice that only Lemma 8 and Theorem 8 yet contain some information 
about the structure of singular designs. The characterization of singular designs 
seems to be a more difficult problem, as Lemma 6 cannot be applied directly. 
In particular, it would be interesting to check whether there exist singular d- 
designs (A, B), where A is r-regular, and r > 2. A partial answer to this question 
is given in [14]. Here is another result on that direction. The prove is similar to 
the proof of Theorem 10. 

Lemma 10. If {A, B) is a d-design, where A is r-regular and r > 1, then A is 
connected. 

8 Sharper Characterizations for (cc, o;)-Graphs 

In this section we apply Theorem 9 to get some new, smaller sets of conditions 
characterizing (a, w)-graphs. It is not difficult to deduce from Theorem 9 (see 
[3]) that G is an (a,u;)-graph iff it has a family of n cliques A and a family 
of n stable sets B such that A^ B = J — I. The following reformulation of this 
statement is a strengthening of a similar result of Hougardy and Gurvich [6] . 

Corollary 5. If G has a family of < n cliques covering all the edges of G such 
that for each vertex v G V, G\v can he partitioned with the help of these cliques, 
then G is an {a, uj) -graph. 

The following theorem provides another characterization of (a, w)-graphs. 

Theorem 11 ([4]). G is an {a,uj)-graph iff it has an a-stahle set Ai such that 
for each vertex s G A\ and stable set S C V ; x(G\s) = uj = uj{G\S). 
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Proof. Let A := {Ai, A 2 , . . . , A^uj+i}, where Ai is the stable set occurring in 
the theorem; fixing an w-coloration of each of the a graphs G\s (s G Ai), 
A 2 , . . . ,Aaui+i denote the stable sets occurring as a color-class in one of these 
colorations. Define B := {Bi, B 2 , • ■ • , Baui+i}, where Bi is an w-clique of G\Ai. 
Now it is straightforward to check that A^ B = J — I { see [4] ). Since G\s 
(s € Ai) has a partition into uj stable sets, n <auj + l\s obvious. On the other 
hand, A has full column rank, and n > aoj + 1 follows. Thus, n = aui + 1 and 
{A, B) is an (a, w)-design. The fact that B is the set of all w-cliques of G follows 
now by noticing that an arbitrary w-clique Q is disjoint from exactly one element 
Ai G A: its incidence vector is the unique solution of the equation A^x = 1 — e^. 
On the other hand, one of the columns of B also satisfies this equation. In the 
same way A is the set of all a-stable sets. 

Notice that Theorem 11 immediately implies Theorem 2 and all the proper- 
ties of minimal imperfect graphs shown by Padberg [11]. 

From the proof of Theorem 11 we get: 

Corollary 6 ([16]). Let G he a graph and So be an a-stable set in G. If n = 
auj-\-land for each vertex s G So and a-stable set S C V; x(G\s) = co = uj{G\S), 
then G is an {a, uf)- graph. 

Here is another interesting consequence of Theorem 9: 

Corollary 7 ([16]). G is partitionahle iff for some p,q>2 such that n < pq-\-l, 
G has a family of n stable sets, A, such that each vertex is in at least p of the 
sets A; and A has no sets intersecting every q-clique. 

Proof. Let B be the matrix the ith column of which is the incidence vector of a 
g-clique disjoint from the stable set corresponding to the zth column of A. Then 
A^ Bl > pqn, hence n = pq-\-l and A"^ B = J — I. 

9 Design Vertices and Facets in Polyhedra 

The following two theorems contain both Padberg’s theorem on minimally im- 
perfect polytopes [11] and Lehman’s theorem on minimally non-ideal polyhedra 
[8], and the second one also contains the part of Sebd [14], which corresponds to 
nonsingular matrix equations. In the proofs we mainly use the ideas of Lehman 
[8], Padberg [10], several results on d-designs of the present work and the follow- 
ing simple but surprising fact communicated by Sebd [14]: if P is a facet {0, 1} 
polyhedron such that, for each i <n, both P\i and P/i are {0, l}-polyhedra then 
P is full dimensional. 

Theorem 12. Suppose P is a full dimensional, vertex {0, 1} polyhedron such 
that, for each 1 < i < n, both P\i and P/i are {0, l}-polyhedra, and F is a 
non-{Q, 1} facet of P. Then F is a d-design facet. Moreover, if we denote by 
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(A,B) the d-design corresponding to F, then the following cases are possible: 
1. Either {A, B) is a DPP-design or 



{A,B) 





and then F is the unique non-{0, 1} facet of P. 

2. A is totally regular and P has at most two non-{0, 1} facets. Moreover, if B 
is nonsingular, then F either is an (r, s) -design facet or a C-design facet. 

Proof. Let _F = {xSP:a-x = 1}. As for each i, P/i is a {0, l}-polyhedron, it 
follows that a has full support. Denote by A the matrix, the iih. column of which 
is the zth vertex of F. Since a has full support, it follows that F is bounded, 
A has full row rank and has no all one column. On the other hand, as P\i is a 
{0, l}-polyhedron, it follows that for each i, affine hull {A/i) can be given with 
the help of {0, l}-constraints. Hence by Theorem 4, A is a DE-matrix. Now, it 
follows from Lemma 4 that all the neighboring facets of F are {0, 1}. F cannot 
have neighboring facets of type Xj = 0, for otherwise ai. = ef and A = I, which 
is impossible. Hence E is a d-design facet. 

Thus, by Theorem 8, either A = DPP or A is totally regular. If A = DPP 
then it is not difficult to proof that we have case 1. As P can have at most two 
parallel facets, the proof is finished by Theorem 10. 



Theorem 13. Let P be a facet {0,1} polyhedron. If for each i < n, both P\i 
and P/i are {0, l}-polyhedra then either P = P>(A), where A = DPP, or P has 
at most two fractional vertices, both of which are {r, s) -design vertices. 
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Abstract. We characterize when the intersection of a set-packing and a 
set-covering polyhedron or of their corresponding minors has a noninte- 
ger vertex. Our result is a common generalization of Lovasz’s characteri- 
zation of ‘imperfect’ and Lehman’s characterization of ‘nonideal’ systems 
of inequalities, furthermore, it includes new cases in which both types of 
inequalities occur and interact in an essential way. The proof specializes 
to a conceptually simple and short common proof for the classical cases, 
moreover, a typical corollary extracting a new case is the following: if 
the intersection of a perfect and an ideal polyhedron has a noninteger 
vertex, then they have minors whose intersection’s coefficient matrix is 
the incidentee matrix of an odd eircuit graph. 



1 Introduction 

Let A- and A- be 0-1-matrices (meaning that each entry is 0 or 1) with 
n columns. We will study the integrality of the intersection P{A-,A~) := 
P-{A~) n P-(A-) of the set-packing polytope P-{A~) = {x G M” : A- x < 
1, X > 0} and the set-covering polyhedron P-{A~) = {x € M” : A- x > 1, x > 
0}. We will speak about {A-,A~) as a system of inequalities, or simply system. 

Obviously, one can suppose that both the rows of A- and those of A- are 
incidence (‘characteristic’) vectors of a clutter, that is of a family of sets none 
of which contains the other. The sets in the clutters and their 0-1 incidence 
vectors will be confused, and with the same abuse of terminology, clutters and 
their matrix representations (where the rows are the members of the clutter) will 
not be distinguished. If A- and A- do not have equal rows, that is (explicit) 
equalities, we will say that {A-,A~) is simple. 

The constraints defining P-{A~) will be called of packing type, and those 
defining P-{A~) of covering type. A vertex of P{A-,A~) can also be classified 
to be of packing type, of covering type, or of mixed type, depending on whether 
all nonequality constraints containing the vertex are of packing type, of covering 
type, or both types occur. 

* Visiting the Research Institute for Mathematical Sciences, Kyoto University. 

R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 37-52, 1998. @ Springer-Verlag Berlin Heidelberg 1998 
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The blocker of a clutter A- (or the antiblocker of A-) is the set of inclu- 
sionwise minimal (resp. maximal) integer vectors in P-(A-) (resp. P-(A-) ). 
These are 0-1-vectors and define another clutter. 

A polyhedron in this paper is the set of all (real) solutions of a system of 
linear inequalities with integer coefficients. A polytope is a bounded polyhedron. 
For basic definitions and statements about polyhedra we refer to Schrijver [11], 
and we only repeat now shortly the definition of the terms we are using directly. 
A face of a polyhedron is a set we get if we replace certain defining inequalities 
with the equality so that the resulting polyhedron is nonempty. A polyhedron is 
integer if each of its faces contains an integer point, otherwise it is noninteger. 

If A C M", we will denote by r{X) the (linear) rank of X, and by dim(A) 
the dimension of X, meaning the rank of the differences of pairs of vectors in 
X, that is, dim(A) := r[{x — y : x,y € X}). 

If P is a polyhedron, then its faces of dimension dim(P) — 1 are called facets, 
and its faces of dimension 0, vertices. All (inclusionwise) minimal faces of P have 
the same dimension. We say that P has vertices, if this dimension is 0. 

It is easy to see that P{A- , A-) has vertices for all A-, A- . (If a minimal face 
is not of dimension 0, it contains an entire line, contradicting some non-negativity 
constraint.) So P{A- , A-) is integer if and only if it has integer vertices. 

A vertex of a full dimensional polyhedron is simplicial, if it is contained in 
exactly n facets. A simplicial vertex has n neighbouring vertices. Neighbours 
share n — 1 facets. 

If A- is empty, a combinatorial coNP characterization of the integrality of 
P{A-,A~) is well-known (Lovasz [8], Padberg [9]). If A- is empty, a recent 
result of Lehman solves the problem (Lehman [6], Seymour [12]). A common 
generalization of these could be a too modest goal: if for every i G {1, . . . ,n} 
either the i-th column of A- or that of A- is 0, then the nonintegrality of 
P{A-,A~) can be separated to the two ‘classical’ special cases. Such systems 
{A-,A~) contain both special cases, but nothing more. There are less trivial 
examples where P-(A-) and P-(A-) do not really interact in the sense that 
all fractional vertices of P-(A-, A-) are vertices of P-(A-) or of P-(A-). 

In this work we characterize when the intersection of a set-packing and a set- 
covering polyhedron or that of any of their corresponding minors is noninteger. 
The results contain the characterizations of perfect and ideal polyhedra and new 
cases involving mixed vertices. The special cases are not used and are not treated 
separately by the proof: a common proof is provided for them instead. 

Graphs G = (V,E) are always undirected, V = V{G) is the vertex-set, 
E = E{G) the edge-set; 1 is the all 1 vector of appropriate dimension. 

If X G M”, the projection of x parallel to the t-th coordinate is the vector x* = 
(xi, . . . , Xi_i, Xi+i, . . . , x„). Let us fix the notation V := {1, . . . , n}. If A C ]R", 
the projection parallel to the t-th coordinate of the set A is A* := {x* : x G A}; 
if / C P, X^ is the result of successively projecting parallel to z G / (the order 
does not matter). 

Let P := P-(A-) or P := P-(A-), and I, J CV, I H J = (j. A minor of P 
is a polyhedron P\ // J := (P n {x : Xi = 0 if z G The set I is said to be 
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deleted, whereas J is contracted. For set-packing polyhedra the contraction of J 
is the same as the deletion of J. 

It is easy to see that P- \ I/J= P-{A'~), and P- \ I/J= P-{A'~), where 
A'- , A'- arise from A- , resp. A- in a simple way: delete the columns indexed by 
I, and then delete those rows that are no more maximal, resp. minimal; for A- 
do the same with J; for A- delete the columns indexed by J and also delete all 
the rows having a 1 in at least one of these columns. Hence minors of set-packing 
or set-covering polyhedra are of the same type. 

We do not use the terms ‘contraction’ or ‘deletion’ for matrices (or clutters), 
because that would be confusing here for several reasons, one of which being that 
these operations do not only depend on the matrix (or clutter) itself. But we 
define the minors of the ordered pair {A-,A-)\ {A-,A~) \ I/J := {A'~ ,A'~), 
where /, J, A'~, H'- are as defined above. The polyhedra P-{A'~) and P-(H'~) 
will be called corresponding minors of the two polyhedra P-{A'~) and P-{A'~). 
Parallelly, for a clutter (matrix) A and v & V we define the clutter A — v := 
{A&A:v^A}onV\ {u}. 

If P = P-(A-) is integer, then P and A- are called perfect, whereas if 
P = P- (H-) is integer, P and A- are called ideal. All minors of perfect and ideal 
matrices are also ideal or perfect respectively. If a matrix is not perfect (not ideal) 
but all its proper minors are, then it is called minimal imperfect, or minimal 
nonideal respectively. It is easy to see that the family of the n — 1-tuples 

of an n-set is minimal imperfect, and it is also an easy and well-known exercise 
to show that matrices not containing such a minor (or equivalently having the 
‘dual Helly property’) can be represented as the (inclusionwise) maximal cliques 
of a graph. We will call {n = 3,4, . . . , ) minimal nongraph clutters. The 

degenerate projective plane clutters 1F„ ={{l,...,n— l},{l,n},{2,n},...,{r^ — 
l,r^}}, (n = 3,4, . . .) are minimal nonideal. 

It is easy to show that the blocker of the blocker is the original clutter. The 
antiblocker of the antiblocker of is not itself, and this is the only exception: 
it is another well-known exercise to show that the antiblocker of the antiblocker 
of a clutter that has no minor (dual Helly property), is itself. 

A graph G is called perfect or minimal imperfect if its clique-matrix is so. It 
is said to be partitionahle, if it has n = au> I vertices (a,w G IM), and for all 
V G V{G), G — V can be partitioned both into a cliques and into ut stable-sets. 
Lovasz [8] proved that minimal imperfect graphs are partitionable and Padberg 
[9] proved further properties of partitionable graphs. 

Analogous properties have been proved for nondegenerate minimal nonideal 
clutters by Lehman [6], from which we extract: a pair of clutters {A,B), where 
B is the blocker or the antiblocker of A will be called partitionahle, if they 
are defined on P :={!,..., n}, n = rs — p -\- 1, {r,s G IN, p, G 2Z, 0 < p < 
minjr, s}),/i yf 1, and for all u G P there exist sets Ai,...,Ag G A and sets 
Bi,...,Br & B such that u G Ai, . . . , A^, Hi, . . . , and both {Ai \ v, . . . ,A^\ 
u, A^+i, . . . , As} and {Bi\v, . . . ,H^\u,H^+i, . . .,Br} are partitions of P\{u|. 
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Remark 1. The clique matrix of a partitionable graph is a partitionable clutter 
with to = r,a = s, ^ = 0. 

Supposing that {A,B) is partitionable, it is easy to see that they are an- 
tiblockers of each other if and only if /r = 0, and they are blockers of each other 
if and only if /i > 2. Indeed, if {A,B) are partitionable it can be shown (Pad- 
berg [9], [10]) that A and B have exactly n members and these can be indexed 
Ai, . . An, Bx, . . . ,Bn so that \Ai n is 1 if z yf j and is ^ if z = j. These 
properties will be proved directly for general ‘minimal noninteger systems’. 

We will call A partitionable, if {A, B) is partitionable, where B is the blocker 
or the antiblocker of Al ~ we will always make clear which of the two is meant. 

Let A be partitionable. Clearly, 1/rl G P~{A) if ^ = 0, and 1/rl G P~{A) 
if /z > 2 (it is actually the unique full support noninteger vertex of P- or P-, 
for minimal nonideal or minimal imperfect polyhedra, it is the unique fractional 
vertex). Let us call this the regular vertex of P-{A), or of P-{A). The regular 
vertex of 1F„ and that of is defined as their unique fractional vertex. 

The idea of this work originates in the frustrating similarities between min- 
imal imperfect and minimal nonideal matrices and the proofs of the results. 
This similarity becomes fascinating when comparing Seymour’s proof [12] of 
Lehman’s, and Gasparyan’s direct proof [3] of Lovasz’s and Padberg’s theorems. 

Despite these similarities, the generalization has to deal with several new 
phenomena, for instance P{A- , A-) can be empty, and its dimension can also 
vary. (Antiblocking and blocking polyhedra are trivially full dimensional !) We 
will meet many other difficulties that oblige us to generalize the notions and 
arguments of the special cases - without making the solution much more compli- 
cated. The proof synthesizes polyhedral and combinatorial arguments, moreover 
a lemma involving the divisibility relations between the parameters will play a 
crucial role when mixed fractional vertices occur. 

We show now an example with mixed vertices. Surprisingly, this will be 
the only essential (‘minimal noninteger’) new example where the two types of 
inequalities interact in a nontrivial way. In a sense, a kind of ‘Strong Perfect 
Graph Gonjecture’ is true for mixed polyhedra. 

If A- U AA = E{C 2 k+i) G 2^(G) {k G IN), and neither A- nor A- is empty, 
then P{A- , A-) will be called a mixed odd circuit polyhedron, and (A-,A-) 
will be called a mixed odd circuit. The unique fractional vertex of a mixed odd 
circuit polyhedron is 1/21. 

Let now {A- ,A~) be a simple odd circuit. Let us define Bi to be the (unique) 
subset of vertices of the graph C^k+i having exactly one common vertex with 
every edge of C 2 k+i except with (z,z -|- 1); the number of common vertices of 
Bi with the edge (z,z -I- 1) is required to be zero or two depending on whether 
its incidence vector is in A- or A- respectively (z = 1, . . . ,2k + 1, z -I- 1 is 
understood mod n = 2k + 1). The neighbors of the vertex 1/2 1_ on P{A-,A~) 
are the characteristic vectors of the Bi, {i = 1, . . . , n = 2k + 1) . Follow these and 
other remarks on Cy: 

Example 1. (an odd circuit polyhedron) Let us define P{A-,A~) C IR^ with: 

Xi + Xi+x < 1 (z = 1, 2, 3, 4), Xi + Xi+x > 1 (z = 5, 6, 7; for z = 7 , z -I- 1 := 1). 
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This polyhedron remains noninteger after projecting 1: indeed, the inequality 
xy — X 2 > 0 is a sixth facet-inducing inequality (containing the vertex 1/21) 
besides the five remaining edge-inequalities. These six inequalities are linearly 
independent ! (The projection of a vertex is still a vertex if and only if the 
projection is parallel to a coordinate which is nonzero both in some set-packing 
and some set-covering facet containing the vertex.) But the new inequality is not 
0-1 ! However, a study of nonintegrality should certainly include this example. 

The vertices of P{A-,A~) are, besides (1/2)1, the sets Bi, {i = 1,...,7). 
These are the shifts of B 2 := {1,4,6} by ±1 and 0,2, and of Bq := (2,4, 6,7} 
by 0,±1. Note that the vector (0, 1, 1, 1, 0, — 1, — 1) is orthogonal to all the Bi 
{i = 1, ... ,7), whence the 7x7 matrix B whose rows are these, is singular\ 

In general, if (H-,H-) is a simple mixed odd circuit, and A- has one more 
row than A-, then l^H- — A- (defines a Chvatal-Gomory cut and) is or- 
thogonal to all the Bi~s (f = 1, . . . , 2/c -I- 1), so they are linearly dependent ! 

Linear independence of the neighbors of fractional vertices play a funda- 
mental role in the special case of Padberg [9], [10], Lehman[6], and also in Gas- 
paryan [3], [4]. Mixed odd circuits show that we have to work here without this 
condition. As a consequence we will not be able to stay within matrix terms, but 
will have to mix combinatorial and polyhedral arguments: Lemma 8 is mostly 
a self-contained lemma on matrices, where the polyhedral context, through 
Lemma 7 brings in a stronger combinatorial structure: ‘r = 2’. The matricial 
part of Lemma 8 reoccurs in papers [4] and [5] , studying the arising matrix equa- 
tions. The latter avoids the ‘nonsingularity assumption’ replacing Lemma 7 by 
combinatorial (algebraic) considerations. 

This paper is organized as follows: Section 2 states the main result, its corollaries, 
and reformulations. The proof of the main result is provided in sections 3 and 
4. Section 5 is devoted to some more examples and other comments. 

2 Results 

When this does not cause missunderstanding, we will occasionnally use the 
shorter notations := P^(A^), P^ := P^{A^), P -.= P{A^ , A^) =P<nP>. 
Recall that the polyhedra P-{A'~) := P- \ I/J and P-{A'~) := P- \ I/J, 
{I,J C V := {!,..., n}, I r\ J = 0) are called corresponding minors, and 
(A'-, A'-) =: (A-, A-) \ // J is a minor of (A-, A-). (Note that two minors are 
corresponding if and only if the two I U J are the same, since for set-packing 
polyhedra deletion is the same as contraction.) Furthermore, if for all such I, J 
the polyhedron (P-(A-) \ I/J) n (P-(A-) \ I/J) is integer, then the system 
(A-, A-) will be called fully integer. 

Theorem 1. Let A- and A- he 0-1-matrices with n columns. Then (A-, A-) 
is not fully integer if and only if it has a minor (A'~, A'~) for which at least one 
of the following three statements holds: 
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- A'- is a minimal nongraph clutter, or it is partitionahle with p, = 0, moreover 
in either case the regular vertex of P-{A'~) is in P-{A'~), and it is the 
unique packing type fractional vertex of P-{A'~) n P-{A'~). 

- A'- is a degenerate projective plane, or it is partitionahle with p>2, more- 
over in either case the regular vertex of P-{A'~) is in P-{A'~), and it is 
the unique covering type fractional vertex of P-{A'~) n P-{A'~). 

- {A'-,A'~) is a mixed odd circuit. 

Lovasz’s NP-characterization of imperfect graphs [8] (with the additional 
properties proved by Padberg[10]), follow: 

Corollary 1. Let A- he a 0-1-matrix with n columns. Then A- is imperfect 
if and only if it has either a minimal nongraph or a partitionahle minor A'~ , 
moreover P{A'~) has a unique fractional vertex. 

Specializing Theorem 1 to set-covering polyhedra one gets Lehman’s celebrated 
result [6], see also Seymour [12]: 

Corollary 2. Let A- he a 0-1 -matrix with n columns. Then A- is nonideal if 
and only if it has either a degenerate projective plane or a partitionahle minor 
A'- , moreover P{A'-) has a unique fractional vertex. 

The following two consequences are stated in a form helpful for coNP char- 
acterization theorems (see Section 5): 

Corollary 3. Let A- and A- he 0-Tmatrices with n columns. Then {A-,A~) 
is not fully integer if and only if at least one of the following statements holds: 

- A- has a minimal nongraph or a partitionahle, furthermore minimal imper- 
fect minor with its regular vertex in the eorresponding minor of P-{A~), 

- A- has a degenerate projective plane or a partitionahle minor with its regular 
vertex in the corresponding minor P-{A'-) of P-{A~), where A'- is perfect. 

- {A-,A~) has a mixed odd circuit minor. 

If we concentrate on the structural properties of the matrices A- and A- implied 
by the existence of a fractional vertex we get the following. This statement is not 
reversible: if A- consists of the maximal stable-sets of an odd antihole, and A- 
of one maximal but not maximum stable-set, then {A-,A-) is fully integer, 
although A- is minimal imperfect ! 

Corollary 4. Let A- and A- he 0-Tmatrices with n columns and assume that 
P-{A~) n P-{A~) is a noninteger polyhedron. Then 

- either A- has a minimal imperfect minor, 

- or A- has a degenerate projective plane, or a partitionahle minor, 

- or {A-,A~) has a mixed odd circuit minor. 

Note the asymmetry between ‘minimal imperfect’ in the first, and ‘partitionahle’ 
in the second case (for an explanation see 5.2). 

The results certainly provide a coNP characterization in the following case: 
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Corollary 5. Let A- he a perfect, and A- an ideal 0-1-matrix with the same 
number of columns. Then {A-,A~) is fully integer if and only if it has no mixed 
odd circuit minor. 

These results provide a certificate for the intersection of a set-covering poly- 
hedron and a set-packing polytope or of their corresponding minors to be nonin- 
teger. This certificate can be checked in polynomial time in the most interesting 
cases (see Section 5) . We will however prove Theorem 1 in the following, slightly 
sharper form which leaves the possibility to other applications open - and cor- 
responds better to our proof method: 

We call {A-,A~) comhinatorially minimal noninteger, if P := P{A- , A-) is 
noninteger, but (P- \ z) n (P- \ i) and (P-/i) n (P-/i) are fully integer for all 
i = I, . . . ,n. Clearly, mixed odd circuits have this property. 

Note the difference with the following definition which takes us out of 0-1 
constraints: P is polyhedrally minimal noninteger, if it is noninteger, but PC\{x G 
IR” : Xi = 0} and P® are integer for alH G V. 

Both the combinatorial and the polyhedral definitions require that the inter- 
section of P with each hyperplane Xi = 0 {i G V) is integer. 

The two definitions are different only in what they require from projections, 
and this is what we are going to generalize now. When we are contracting an 
element, comhinatorially minimal noninteger systems require the integrality of 
P^{A^y n P^{A^y instead of the integrality of [P^{A^) n P^{A^)Y in the 
polyhedral definition, and this is the only difference between the two. It is easy 
to see that P-{A~y n P-{A~y D [P-(A-) n P-(Gl-)]*, and we saw (see 
Example 1) that the equality does not hold in general, so the integrality of 
P-{A-ynP-{A~y and that of [P-(A-)nP-(^-)]* are seemingly independent 
of each other. The combinatorial definition looks actually rather restrictive, since 
it also requires that fixing a variable to 1 in P- (Gl-), and fixing the same variable 
to 0 in P-{A~) the intersection of the two polyhedra we get is integer. 

Note however, that surprisingly, the results confirm the opposite: the com- 
binatorial definition is less restrictive, since besides partitionable, minimal non- 
graph and degenerate projective clutters, it also includes mixed odd circuit poly- 
hedra, which are not polyhedrally minimal noninteger ! 

Our proofs will actually not use more about the projections than the fol- 
lowing simple sandwich property of P which is clearly implied by both com- 
binatorial and polyhedral minimal nonintegrality {Qi can be chosen to be the 
polyhedron on the left hand side or the one on the right hand side respectively): 
for all i = 1, ... ,n, there exists an integer polyhedron Qi such that 

[P^{A^) n p^{A^)Y cq,c p<{A^y n p^{A^y. 

Let us call the system {A-,A~) minimal noninteger, if 

- P is noninteger, and 

- PD{x€WQ :xi = 0}(= P^n{x G M” : Xi = 0}nP^n{x G M” : X* = 0}) 

is an integer polyhedron for all z G V, and 

- P has the sandwich property. 
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Theorem 2. If {A-,A~) is minimal noninteger, simple, and w G P is a frac- 
tional vertex, then P is full dimensional, w is simplicial, and at least one of the 
following statements hold: 

- w is of packing type, and then A- is either a minimal nongraph clutter, or 
the clique-matrix of a partitionable graph, 

- w is of covering type, and then A- is either a degenerate projective plane or 
a partitionable clutter, fJ, >2, 

- w is a mixed vertex, and then {A-,A~) is a mixed odd circuit. 

Moreover, P has at most one fractional vertex of covering type, at most one 
of packing type, and if it has a vertex of mixed type, then that is the unique 
fractional vertex of P. 

Note that Theorem 2 sharpens Theorem 1 in two directions: first, the con- 
straint of Theorem 2 does not speak about all minors, but only about the dele- 
tion and contraction of elements; second, the integrality after the contraction of 
elements is replaced by the sandwich property. 

The corollaries about combinatorial and polyhedral minimal nonintegrality 
satisfy the condition of Theorem 2 for two distinct reasons. In the combinatorial 
case simplicity does not necessarily hold, but deleting the certain equalities from 
A-, the system remains combinatorially minimal noninteger (see 5.2). 

Corollary 6. If{A-,A~) is combinatorially minimal noninteger, then at least 
one of the following statements holds: 

- A- is a minimal nongraph or a partitionable clutter with n = 0, furthermore 
it is minimal imperfect, and the regular vertex of P-{A~) is the unique 
packing type fractional vertex of P-{A~) n P-{A~). 

- A- is a degenerate projective plane, or a partitionable clutter with /i > 2, 
while A- is perfect, and the regular vertex of P- {A - ) is in P- {A - ) . 

- {A-,A~) is a mixed odd circuit, and 1/21 is its unique fractional vertex. 

This easily implies Theorem 1 and its corollaries using the following remark, 
(it is particularly close to Corollary 3), while the next corollary does not have 
similar consequences. This relies on the following: 

- If P is noninteger, {A-,A-) does contain a combinatorially minimal nonin- 
teger minor. (Proof: In both P- and P- delete and contract elements so that 
the intersection is still noninteger. Since the result has still 0-1 constraints 
this can be applied successively until arriving at a combinatorially minimal 
noninteger system.) 

- If P is noninteger, one does not necessarily arrive at a polyhedrally minimal 
noninteger polyhedron with deletions and restrictions of variables. (Coun- 
terexample: Example 1.) 

Corollary 7. //P-(T-) nP-(A-) is polyhedrally minimal noninteger, then at 
least one of the following statements holds: 




Characterizing Noninteger Polyhedra with 0-1 Constraints 



45 



- either A- is a minimal nongraph or a partitionahle clutter with /i = 0, and 
the regular vertex of P-{A~) is the unique packing type fractional vertex of 
P^{A^)nP^{A^). 

- or A- is a degenerate projective plane, or a partitionahle clutter with p.> 2, 
and the regular vertex of P-{A~) is in P-{A~). 

Proof. Express w’’ as a convex combination of vertices of P*. Replacing the 
vectors in this combination by their lift, we get a vector which differs from w 
exactly in the i-th coordinate (t = 1, . . . , n) - if it did not differ, w would be the 
convex combination of integer vertices of P. So the i-th unit vector is in the linear 
space generated by P for all t = 1, . . . , n, proving that P is full dimensional, in 
particular, simple. So Theorem 2 can be applied, and its third alternative cannot 
hold (see Example 1). □ 

Gasparyan [4] has deduced this statement by proving that in the polyhedral 
minimal case the matrices involved in the matrix equations are nonsingular (see 
comments concerning nonsingularity in Example 1). 

The main frame of the present paper tries to mix (the polar of) Lehman’s 
polyhedral and Padberg’s matricial approaches so as to arrive at the simplest 
possible proof. Lemmas 1-4 and Lemma 7 are more polyhedral. Lemma 5, 
Lemma 6 and Lemma 8 are matricial and combinatorial. When specializing 
these to ideal clutters, their most difficult parts fall out and quite short variants 
of proofs of Lehman’s or Padberg’s theorem are at hand. 

3 Prom Polyhedra to Combinatorics 

The notation A, B will be used for families of sets. (We will also use the notation 
A for the matrices whose rows are the members of A.) The degree d^(u) of v in 
A is the number of A € A containing v. 

Given w G P, let Aw be the set of those rows A of A- or of A- for which 
w(^) = 1. (We do not give multiplicites to the members of Aw, regardless of 
whether some of its elements are contained in both A- and A- !) We also define 
these if the polyhedron also has non-O-l-constraints. Then A- and A- denote 
the set-covering and set-packing inequalities in the defining system. 

If P is integer, we define Bw as the family of those 0-1 vectors (vertices of P) 
which are on the minimal face of P containing w. (Equivalently, Bw is the set of 
vertices having a nonzero coefficient in some convex combination expressing w.) 
If Ag Aw, and B G Bw, then |^nR| = 1. Glearly, r{Aw) + r{Bw) = dimP-|-l. If 
it is necessary in order to avoid misunderstanding, we will write Aw{P), Bw{P). 

The following lemma is based on the polar (in the sense of interchanging ver- 
tices and facets) of a statement implicit in arguments of Lehman’s and Seymour’s 
work (see Seymour [12] ). 

Lemma 1. If Q is a polyhedron with 0-1 vertices (and not necessarily 0-1- 
constraints) and w G Q, w > 0, then Ub6B„ B = V , and 

r{Bw) > max { j^j : ^ G Aw^, r{Aw) <n — max {j^j : A G Aw^ + 1. 
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Proof. Indeed, since Q is integer, w is the convex combination of 0-1 vertices in 
Bw, whence = V .In particular, for A G Aw and all a G A there exists 

Ba & Bw, such that a G Ba. 

Since A G Aw, and Ba G Bw, we have \A(^ Ba\ = 1, and consequently 
A(^ B a = {a}. Thus {Ba '. a G A} consists of |A| linearly independent sets of 
Bw, whence r{Bw) > \A\. □ 



Remark 2. Compare Lemma 1 with Fonlupt, Sebo [2]: a graph is perfect if and 
only if the linear rank of the maximum cliques (as vertex-sets) in every induced 
subgraph is at most n — ui + \ where oj is the size of the maximum clique in the 
subgraph; the equality holds if and only if the subgraph is uniquely colorable. 

We note and use in the sequel without reference that if P is minimal non- 
integer, then w > 0 for all fractional vertices w of P {wi = 0 would imply that 
(P- \ t) n (P- \ i) is also noninteger). 

In sections 3 and 4 / will denote the identity matrix, J the all 1 matrix 
of appropriate dimensions; A is called r-regular, if lA = rl, and r-uniform if 
A\ = rl; := {V \ A : A G A}. A is said to be connected if V cannot be 
partitioned into two nonempty classes so that every A G A is a subset of one of 
the two classes. There is a unique way of partitioning A and V into the connected 
components of A. 

Lemma 2. If{A-,A~) is minimal noninteger, w is a fractional vertex of P := 
P(A^,A^), and A C A w T>s a set of n linearly independent members of j^w, 
then every connected component K of A’^ is n — rK -regular and n — vk - uniform 
(rx G INj, arid r{A — v) = n — d^iv). 

Proof. Recall that w > 0. If P is minimal noninteger, then for arbitrary i G V the 
sandwich property provides us Qi C w* G [P-(A-)nP-(A-)] * C Qi C 

P-(A-)* n P-{A~y, that is, w* G Qi and w* > 0. Applying the inequality in 
Lemma 1 to Qi and w*, and using the trivial but crucial fact that Awi {Qi) A A—i, 
we get the inequality r(A — i) <n — max { | A| : A G A — i}. 

On the other hand, r{A) = n by assumption. One can now finish in a few 
lines like Conway proves de Bruijn and Erdos’s theorem [7], which is actually 
the same as Seymour [12, Lemma 3.2]: 

Let H := A'’ for the simplicity of the notation. What we have proved so far 
translates as dn{v) < \H\ for dll v G PI GPL. But then, 

E 1= E E vii?i = E E ^m = Y.dn{v)i\H\<Y.^^ 

H&n HgT-CvGH vGV HgT-C,vGH vev vGV 

and the equality follows. □ 

Remark 3. The situation of the above proof will be still repeated several times: 
when applying Lemma 1, the 0-1 vectors that have an important auxiliary role 
for bounding the rank of some sets are in Bw{Qi), and are not necessarily vertices 
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of P. The reader can check on mixed odd circuits that the neighbors B = 
{Bi, , Bn} of 1/2 1_ are not suitable for the same task (unlike in the special 
cases): the combinatorial ways that use B had to be replaced by this more general 
polyhedral argument. Watch for the same technique in Lemma 7 ! 

The next lemma synthesizes two similar proofs occurring in the special cases: 

Lemma 3. If {A-,A~) is minimal noninteger, w and w' are fractional vertices 
of P, then defining A and A! to he a set of n linearly independent vectors from 
Aw, Aw' respectively, A and Af cannot have exactly n — 1 common elements. 

Proof. (Sketch) Apply Lemma 6 to both w and w'. With the exception of some 
degenerate cases easy to handle, any member of an r-regular clutter can be 
uniquely reconstructed from the others. □ 

Lemma 4. If{A-,A~) is minimal noninteger and simple, andw is a fractional 
vertex of P, then P := P{A- , A-) is full dimensional, w is simplicial, and the 
vertices neighbouring w on P are integer. 

The proof can be summarized with the sentence: a minimal noninteger system 
cannot contain implicit equalities (only ‘explicit’ equalities). 

Proof. Let us first prove that P is full dimensional. By Lemma 3 Aw is linearly 
independent (recall that every member was included only once) . Suppose 0 G M” 
can be written as a nontrivial nonnegative linear combination of valid inequal- 
ities. Clearly, all of these are implicit equalities (see [11]) of P. In particular 
their coefficient vectors are in Aw- In this nontrivial nonnegative combination 
there is no nonnegativity constraint Xi > 0, because otherwise P C }x : Xi = 0}, 
contradicting w > 0. So everything participating in it is in Aw contradicting its 
linear independence. 

Since Aw is linearly independent, w is simplicial. If a neighbour w' of w is 
noninteger, we arrive at a contradiction with Lemma 3. □ 

We will say that a polyhedron P is minimal noninteger if P = P{A-,A~) 
for some simple, minimal noninteger system. (Since P is full dimensional by 
Lemma 4, it determines {A-,A-) uniquely.) 

Given a minimal noninteger polyhedron P and a fractional vertex w of P, 
fix A := A(w) := Aw and let B := B{w) denote the set of vertices neighboring 
w in P. 

Note that U”_]^P„i(P*) = B{w) holds in the polyhedrally minimal noninteger 
case, but does not necessarily hold otherwise, and therefore we need essential 
generalizations. Do not confuse Bw (which is just {w}) with B{w). 

We will say that a vertex B G B and a facet A G A not containing it are asso- 
ciates . By Lemma 4 w is simplicial, whence this relation perfectly matches A and 
B. We will suppose that the associate of the z-th row Ai of A is the associate Bi of 
A,; jjL^ := \Ai n Bi\. Clearly, yf 1 (z = 1, . . . , n). Denoting by diag(di, . . . , d„) 
the nx n diagonal matrix whose diagonal entries are di, d 2 , . . . , dn, we have 
proved: 
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Lemma 5. = J + diag(/zi — 1, . . . , — 1), where {i = 1, . . . ,n). 

If fii does not depend on i, we will simply denote it by /i. (This notation is 
not a coincidence: in this case {A, B) turns out to be partitionable where /i is 
the identically denoted parameter.) By Lemma 5, /i yf 1. 

The main content of Lemma 3, 5, some aspects of Lemma 4 and most of 
Lemma 6 are already implicitly present already in Padberg[9]. 

4 Associates and the Divisibility Lemma 

The following lemma extracts and adapts to our needs well-known statements 
from Lehman’s, Seymour’s and Padberg’s works, and reorganizes these into one 
statement. It can also be deduced by combining results of Gasparyan [4], which 
investigate combinatorial properties implied by matrix equations. For instance 
the connectivity property of Lemma 6 below is stated in [4] in a general self- 
contained combinatorial setting. 

Lemma 6. If P is minimal noninteger, and w G P is a fractional vertex of P, 
then A = A(w) is nonsingular and connected, moreover, 

- if the clutter A'^ is connected, then I A = Al = rl, r >2. 

- if the clutter A'^ has two components, then A is a degenerate projective plane. 

- if the clutter A'^ has at least three components, then A = 

Proof. (Sketch) If A^ has at least two components, then any two sets whose 
complements are in different components cover V. This, and the matrix equation 
of Lemma 5 determine a degenerate combinatorial structure. (For instance one 
can immediately see that the associate of a third set has cardinality at most two, 
and it follows that all but at most one members of B have at most two elements.) 

If A'^ has one component, then the uniformity and regularity of A^ claimed 
by Lemma 2 implies that of A. □ 

Recall that the nonsingularity of B cannot be added to Lemma 6 ! 

It is well-known that both for minimal imperfect and minimal nonideal ma- 
trices the associates of intersecting sets are (almost) disjoint. In our case they 
can also contain one another, and the proof does not fit into the combinatorial 
properties we have established (namely Lemma 5). We have to go back to our 
polyhedral context (established in the proof of Lemma 2, see also Remark 3): 
Let us say that A with AB"'" = J + diag(^i — 1, . . . , — 1), where yf 1 

(i = 1, . . . , n) is nice, if for A\,A 2 G A, v G Ai(^ A^ the associates Ri, R 2 G B oi 
Al and A 2 respectively, either satisfy Ri n R 2 \ = 0 or Ri \ {u} = R 2 \ {«}. 

(In the latter case, since Bi and B 2 cannot be equal, one of the two contains v.) 



Lemma 7. Let P be minimal noninteger, and A = A{w), B = B{w) for some 
noninteger vertex w G P. Then A is nice. 
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Check the statement for the mixed Cy of Example 1 ! (It can also be instructive 
to follow the proof on this example. ) 

Proof. Let v & A\, A 2 & A, and let Si,i ?2 G .8 be their associates. Moreover 
assume u G i?i n i ?2 \ {^'}• Let Aq & A, u & A, v ^ Aq. (There exists such an 
Aq € A since for instance Lemma 6 implies that a column of A cannot dominate 
another.) Since P is minimal noninteger, there exists an integer polyhedron Qy 
such that [P^(A^) n P^{A'^)Y C C p< D P^ {A'^)\ Now because of 
Aw«{Qv) A — i, the scalar product of the vertices of Byjv(Qy) with all vectors 
in M — z is 1, and the proof method of Lemma 1 can be applied: 

For every a G ylo \ zt fix some Ba G Bw«{Qv) so that a G Ba- Now {Ba ■ 
a G ^0 \ m} U {B\ \ V, B 2 \ u} are r + 1 vectors in 1R^^“ all of which have 
exactly one common element with each A C A — v. On the other hand, by 
Lemma 2 r{A — v) = n — r = (n — 1) — (r — 1), so there can be at most 
r linearly independent sets with this property. Hence there exists a nontrivial 
linear combination Ai(Hi\z;) + A2(i32\z;)+X)a6A\ii ^a.Ba = 0. Since for a G g1o\m 
the unique vector in this linear combination which contains a is Ba, one gets 
that Aa = 0 for all a G A\u. It follows that Bi\v = B 2 \v, and Ai = A 2 . □ 

Although the following statement is the heart of our proof, it is independent 
of the other results. The very root of the statement is the simple observation 
that n + dj is a multiple of r. Note that in order to deduce r = 2 we need more 
than just the matrix equation ! 

Lemma 8. Assume that A, B are 0-1 matrices, lA = Al = rl, and 
AB'^ = J + diag(^i — 1, . . . , /x„ — 1), fii 1. Then 

- either /ii = . . . = =: /z, and then AB"^ = B'^A = J + (/i — 1)/, BJ = 

JB = sJ, (s = (n + ^ — l)/r), 

- or {/ii, . . . , /z„} = {0, r}, and if A is connected and nice, then r = 2. 

Proof. If /ii = /i 2 = ... = /in =: /i, then we finish easily, like [3]: since /z yf 0, 
A is invertible; since A commutes with I, and by assumption with J too, so 
does its inverse; now expressing B'^ from AB"^ = J + (/t — 1)/ we get that it 
is the product of two matrices which commute with both A and J. So 8^ also 
commutes with these matrices, proving the statement concerning this case. (The 
matrices X and Y are said to commute, if XY = YX.) 

So suppose that there exist i,j€V such that /Zi yf /Zj. 

Claim (1). r\Bj | = n + /Zj — 1, and 0 < /zy < r, (j = 1, . . . , n). 

Indeed, = {\A)B'^ = 1(A8^) = l(j + diag(/zi — I, . . . , /z„ — I)) = 

(n + /zi - 1, . . . ,n + /z„ - I). 

The inequality is obvious: 0 < /zy = |Ay n Hy | < |Ay | = r, (j = 1, . . . , n). 



Claim (2). If there exist i,j G E, /Zi yf /zy, then /zy G {0,r} for all j G V. 
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Indeed, according to Claim (1) we have n + fXj — 1 = 0 mod r, where fj,j 
is in an interval of r + 1 consecutive integers representing every residue class 
mod r exactly once, except 0, which is represented twice, by 0 and r. Hence if 
{/xi, . . . , /i„} contains two different values, then these values can only be 0 and 
r as claimed. 

Claim (3). If f € Hi n A 2 , /ii = 0, ^2 = r, then Hi = H 2 \ {f}. 

Indeed, let w G H 2 n Hi. (Because of the matrix equation in the constraint, 
we also know IH 2 n Hi | = 1.) We have |Hi n Hi | = ^1 = 0, and since IH 2 C H 2 I = 
= r = IH 2 I, we also have A 2 C H 2 . 

Since w G Hi and Hi n Hi = 0 : m yf u. Because of H 2 C H 2 we have 
u G (Hi n H 2 ) \ {w}. So we must have Hi \ {u} = H 2 \ {w} by the condition, and 
since v ^ Bi, v & B 2 : Hi = H 2 \ {f}. The claim is proved. 

Now we finish the proof. Since there exist i,j G V such that yf /ij, by 
Claim (2) fij G {0,r} for all j G V. Since H is a connected clutter, there exists 
V G V so that u G Hi n Aj and = 0, = r. After possible renumbering, we 

can assume i = 1, j = 2. 

So let Hi,H 2 G H = A{w), u G Hi n H 2 , fii = 0, fi 2 = r and denote the 
associates of Hi, H 2 by Hi, H 2 respectively. 

By Claim (3), 1 = IH 2 C Hi| = IH 2 C H 2 I — 1 = r — 1, so r = 2. □ 

Proof of Theorem 2. (Sketch) Let (H-,H-) be minimal noninteger. Further- 
more, let ic G H a fractional vertex of P. Let A := H(w) and B := B{w). Then 
we have the matrix equation of Lemma 5. 

Case 1. A‘^ is connected: according to Lemma 6 and Lemma 7 the conditions of 
Lemma 8 are satisfied, and using Lemma 8 it is straightforward to finish. 



Case 2. has two components: by Lemma 6 H is a degenerate projective plane. 
It can be checked then that either A = H- or A is not minimal noninteger. We 
prove this with the following technique (and use similar arguments repeatedly 
in the sequel): we prove first that there exist an x G C so that (P-/i) C (P-/i) is 
noninteger. It turns out then that the maximum p of the sum of the coordiates of 
a vector on (H-/x) H {P- /i) and the maximum q of the same objective function 
on H* are close to each other: [p, q] does not contain any integer (we omit the 
details). So for all Qi such that H* C Qx C (P-/i) n (P-/i) the maximum of 
the sum of coordinates on Qi must lie in the interval [p, q] . Thus Qi cannot be 
chosen to be integer, whence P does not have the sandwich property. 



Case 3. H'^ has at least three components: by Lemma 6 H is the set of n — 1- 
tuples of an n-set. If H = H-, then we are done (again the first statement holds 
in the theorem) . In all the other cases P turns out not to be minimal noninteger 
(with the above-described technique). □ 
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5 Comments 

5.1 Further Examples 

A system (A-,A-) for which a P(A-,A-) C ]R® is integer, but is 

not fully integer: the rows of A- are (1, 1, 0, 0, 0), (0, 1, 1, 0, 0), (1, 0, 1, 0, 0) and 
(0, 0, 1, 1, 1); A- consists of only one row, (0, 0, 0, 1, 1). 

We mention that a class of minimal noninteger simple systems {A- , A-) with 
the property that {A-,A~) \i {i G V) defines an integer, but not always fully 
integer polyhedron, can be defined with the help of ‘circular’ minimal imperfect 
and minimal nonideal systems (see Cornuejols and Novick [1]): define A- := C((, 
A- := C^, where r < s and A- is minimal imperfect. A- is minimal nonideal. 

Such examples do not have mixed vertices, so they also show that the first 
two cases of our results can both occur in the same polyhedron. 

5.2 A Polynomial Certificate 

We sketch why Corollary 6 follows from Theorem 2. Note that Corollary 6 im- 
mediately implies Corollary 3. 

In a combinatorially minimal noninteger system (A-,A-), A- is in fact 
minimal imperfect or perfect. This is a simple consequence of the following: 

Claim. If P- := P-{A~) or P- := P-{A~) is partitionable with a regular 
vertex w G P := P- C P-, and P-/I (/ C V) is partitionable with regular 
vertex w' , then w' G P- /I. 

Indeed, suppose that w is the regular vertex of a polyhedron whose defining 
clutter has parameters (r, s), and let the parameters of w' be (r', s'). So w := 1/rl 
and w' := 1/r'l. 

Now r' < r, because the row-sums of the defining matrix of P- // (which is 
a submatrix of A-) do not exceed the row-sums of A-. Since w € P-{A~), the 
row-sums of A- are at most r. 

But then, if we replace in l/rl_ some coordinates by 1/r' some others by 1 
the vector w" we get majorates \/r\G P-{A~) whence it is also in P-{A~). 
Since w' G P-/I is equivalent to the belonging to P- of such a vector w" , the 
claim is proved. 

To finish the proof of Corollary 6 one can show that after deleting from A- 
an equality from ^Aw\ the system remains minimal noninteger. 

Using appropriate oracles. Corollary 3 provides a polynomial certificate. (For 
the right assumptions about providing the data and certifying the parameters of 
a partitionable clutter we refere to Seymour [12]. We need an additional oracle 
for the set-covering part.) 

The polynomial certificates can be proved from the Claim using the fact that 
for partitionable clutters and perfect graphs the parameters can be certified in 
polynomial time. 

For the non-full-integrality of the intersection of perfect and ideal polyhedra 
a simple polynomial certificate is provided by Corollary 5. 
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Abstract. An important theorem due to Truemper characterizes the 
graphs whose edges can be labelled so that all chordless cycles have 
prescribed parities. This theorem has since proved an essential tool in 
the study of balanced matrices, graphs with no even length chordless 
cycle and graphs with no odd length chordless cycle of length greater 
than 3. In this paper we prove this theorem in a novel and elementary 
way and we derive some of its consequences. In particular, we show how 
to obtain Tutte’s characterization of regular matrices. 



1 Truemper’s Theorem 

Let /3 be a 0,1 vector indexed by the chordless cycles of an undirected graph 
G = {V,E). G is P-balanceable if its edges can be labelled with labels 0 and 1 
such that 1{G) = f3c mod 2 for every chordless cycle G of G, where l{e) is the 
label of edge e and 1{C) = ^e^E(C) ^(®)- 

We denote by (3^ the restriction of the vector /3 to the chordless cycles of an 
induced subgraph H of G. 

In [14] Truemper showed the following theorem: 

Theorem 1. A graph G is (3-balanceable if and only if every induced subgraph 
H of type (a), (b), (c) and (d) (Figure 1) is (3^ -balanceable. 

Graphs of type (a), (b) or (c) are referred to as 3-path configurations (SPG’s). 
A graph of type (a) is called a 3PG{x, y) where node x and node y are connected 
by three internally disjoint paths Pi^P^ and P 3 . A graph of type (b) is called 
a 3>PG{xyz,u), where xyz is a triangle and Pi, P 2 and P 3 are three internally 
disjoint paths with endnodes x, y and 2 respectively and a common endnode 
u. A graph of type (c) is called a 3PC{xyz,uvw), consists of two node disjoint 
triangles xyz and uvw and disjoint paths Pi, P 2 and P 3 with endnodes x and 
u, y and v and 2 and w respectively. In all three cases the nodes of Pi U Pj 
i j induce a chordless cycle. This implies that all paths Pi, P 2 , P 3 of (a) have 
length greater than one. Graphs of type (d) are wheels {H, x). These consist of a 
chordless cycle H called the rim together with a node x called the center, that 
has at least three neighbors on the cycle. Note that a graph of type (b) may also 
be a wheel. 

* Supported in part by a grant from Gruppo Nazionale Delle Ricerche-CNR. 
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Fig. 1. 3-path configurations and wheel 



In this paper, we give an alternative simple proof of Theorem 1 and we 
highlight its importance by deriving some well known theorems, such as the 
Tutte’s characterization of regular matrices, the characterization of balanceable 
matrices and of even and odd signable graphs. Finally we show how to use 
Theorem 1 to obtain decomposition theorems. 

Truemper [14] derived the excluded minor characterization of matroids rep- 
resentable over GF3 as a consequence of Theorem 1. From known results, see 
[13], this implies Tutte’s theorem. Here we offer a more direct derivation. 

First some definitions. N(v) is the set of neighbors of node v. A signed graph 
G is a graph whose edges are labeled with 0 or 1. Given a 0, 1 vector f3 indexed by 
the chordless cycles of G, if a chordless cycle G of G satisfies 1{C) = Pc mod 2, 
then C is signed correctly, otherwise G is signed incorrectly. A P -balancing of G 
is one for which each of its chordless cycles is correctly signed. The operation of 
scaling on a cut <5(S') of G consists of switching (from 0 to 1 and viceversa) the 
labels on all edges in S{S). Since cuts and cycles of G have even intersections, 
we have the following: 

Remark 2. Let G' he a signed graph obtained from G by scaling on a cut. A 
chordless cycle G is correctly signed in G' if and only if G is correctly signed in 
G. 



Assume G is connected and contains a clique cutset Ki with I nodes and let 
G'l , G 2 , . . . , G(j be the components of the subgraph induced by V{G) \Ki. The 
blocks of G are the subgraphs Gi induced by V (G') U Ki, i = 1, . . . ,n. 

Remark 3. If G contains a Ki or K 2 cutset, then G is P-halanceahle if and 
only if all of its blocks are P'^‘ -balanceable. 

Proof: The ’’only if’ part is obvious. We prove the ”if’ statement when G has 
a K 2 cutset {u, u}, since the other case is again immediate. All blocks have a 
/3-balancing in which edge uv has the same label, since all blocks Gi are /3®*- 
balanceable and we can always scale on a cut of Gi separating u and v. The 
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signings of the blocks induce a /3-balancing of G, since every chordless cycle of 
G belongs to one of the blocks Gi. □ 

The following observation appears in [14]. 

Remark 4. A graph G which is a wheel or a 3-path configuration is j3-halanceahle 
if and only if any signing of G produces an even number of incorrectly signed 
chordless cycles. 

Proof: If G is a wheel or a 3-path configuration, every edge of G belongs to 
exactly two chordless cycles. Therefore switching the label of edge uv changes 
the parities of the labels of the two chordless cycles containing uv and of no 
other chordless cycle. Now if G is a wheel or a 3-path configuration and has 
at least two chordless cycles that are signed incorrectly, then either G contains 
two chordless cycles that are signed incorrectly and have a common edge or G 
contains three chordless cycles Gi, G 2 and G 3 such that Gi and G 3 are signed 
incorrectly, G 2 is signed correctly and G 2 has common edges with both Gi and 
G 3 , but Gi and G 3 do not have a common edge. 

Therefore by switching the label of at most two edges, the number of cycles 
that are incorrectly signed decreases by two. □ 

An ordering ei, . . . , e„ of the edges of a connected graph G is consistent if 
the first edges in the sequence belong to a spanning tree T of G and all other 
edges Cj have the property that Cj, together with some edges having smaller 
indices, closes a chordless cycle Cj of G. Note that for any spanning tree T of 
G, G admits a consistent ordering where the edges of T appear first. 

Assume a connected graph G, a vector (3 are given and consider the following 
signing algorithm: 

Let ei,...,en he a consistent ordering of the edges of G. Label the edges of T 
arbitrarily and label the remaining edges Cj so that the chordless cycles Cj are 
signed in accordance with the components Pcj of /3. 

Since every edge of T belongs to a cut of G, containing no other edge of T, 
then Remark 2 shows that if G is /3-balanceable, an arbitrary labeling of the 
edges of T can always be extended to a /3-balancing of G. Therefore the above 
signing algorithm will produce a /3-balancing of G, whenever G is /3-balanceable. 
Conversely, given a consistent ordering where a tree T appears first and a /3- 
balancing of G, this same signing of G is produced by the algorithm when T is 
signed as in the signing of G. 

Remark 5. Let G be a /3-balanceable graph and let Gy be the subgraph of G, 
obtained by removing node v. Then every /3-balancing of Gy with respect to /3®” 
can be extended to a /3-balancing of G. 

Proof: We assume that v is not a cutnode of G, else by Remark 3, we can argue 
on the blocks of G. Now G contains a spanning tree T where u is a leaf of T. 
Order the neighbors of u in G„ as vq,vi, . . . ,Vk, where vq is the neighbor of v 
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in T and Vi is chosen so that amongst all nodes in N{v) \ {uq, . . . , Ui-i} the 
path between nodes v and Vi is shortest in the subgraph of G with edge set 
E{Gy) U {vvo, . . . ,vvi-i}. Now place first the edges of T, then the other edges 
of Gv in a consistent ordering with respect to T \ {u}, then vvi, . . . ,vvk- This 
ordering is a consistent ordering for G and the signing algorithm can be applied 
to produce from the /3-balancing of G„, a /3-balancing of G. □ 



Lemma 6. Let G' be an induced subgraph of G, containing a given chordless 
cycle G and satisfying the following properties: 

1) G' is connected and contains no K\ or K 2 cutset. 

2) G belongs to G' and G' \G is nonempty. 

3) V{G') is minimal with respect to 1) and 2). 

Then G' is a 3-path configuration or a wheel containing G . 

Proof: Let G" be the subgraph of G', induced by K(G') \ y(G). If G" is a single 
node, say u, and u has only two neighbors Ci and Cj in G, then Ci and Cj are 
nonadjacent and G' is a 3PG{ci,Cj). Otherwise G' is a wheel with u as center. 

If G" contains more than one node, by 3) we have that G" is connected and 
that: 

4) Every node of G" has at most two neighbors in G and these two neighbors 
are adjacent. 

5) G" contains at most one pair of nodes, say X\ and Xn such that both xi and 
Xn have neighbors in G and (N{xi) U N{xn)) H V (G) either contains at least 
three nodes or two nonadjacent nodes. 

(Indeed by 3), we have that G" is connected. So if G" contains more that one 
such pair, let xi, Xn be chosen satisfying 5) and closest in G". Let P = x\, . . . ,Xn 
be a shortest path in G" connecting them. The subgraph G* of G, induced by 
V{G) U V{P) satisfies 1) and 2). Then if more that one such pair exists, G* is a 
proper subgraph of G' and this contradicts 3).) 

Let G = Cl, . . . , Cm and assume first that G" contains one pair of nodes, x\, 
Xn satisfying 5). Then by 3), G" is a path P = xi,. . . , x„. If a node of G, say 
Ci, is adjacent to a node Xi, 1 < i < n of P, then by 3) and 4), xi is adjacent to 
Ci-i, possibly to c, and no other node of G. Node Xn is adjacent to Cj+i, possibly 
to Ci (indices modm) and no other node of G. Therefore no other node of G is 
adjacent to an intermediate node of P. In this case, G' is a wheel with center c,. 

If no node of G is adjacent to an intermediate node of P, then by 4) we can 
assume w.l.o.g. that xi is adjacent to Ci_i and possibly Ci and x„ is adjacent to 
Cj+i and possibly cj. If xi or x„ has two neighbors in G and i = j, then G' is a 
wheel with center Cj. In the remaining cases G' is a 3-path configuration. 

If G" contains no pair of nodes satisfying 5), by I) and 4) we have that G is 
a triangle ci, C 2 , C 3 , all three nodes of G have neighbors in G" and no node of G" 
has more than one neighbor in G. If G" is a chordless path P = x\, . . . ,Xn with 
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xi adjacent to, say ci and adjacent to C2, then C3 has some neighbor in P and 
G' is a wheel with center C3. Otherwise let P12 be a shortest path connecting ci 
and C2 and whose intermediate nodes are in G" . If C3 has a neighbor in P12 we 
have a wheel with center C3. Otherwise let P3 be a shortest path connecting C3 
and V (P12) \ {ci, C2} and whose intermediate nodes are in G” . By 3), G' is made 
up by G, together with P12 and P3, furthermore P3 meets P12 either in a node 
X or in two adjacent nodes In the first case, we have a 3PC(ciC2C3, a;), 

otherwise we have a 3PC(ciC2C3, tit20)- 

For e e E{G), G® denotes the graph whose node set represents the chordless 
cycles of G containing e and whose edges are the pairs Gi,G2 in V®(G®), such 
that Gi and G2 belong to a 3-path configuration or a wheel. 

Lemma 7. If e = {u,f} is not a K2 cutset of G, G® is connected. 

Proof: Assume not. Let Gf and G2 be two components of G®. Let Gi be the 
subgraph of G induced by the node set Uc 6G'1^(G), for i = 1, 2. 

Assume first that {m, u} is a K2 cutset separating Gi from G2 in the graph 
induced by V{G\) U V(G2). Pick Gi € Gf and G2 G G2 and a path P in G such 
that in the subgraph G' of G induced by V{G\) U V{C2) U V{P), {m, u} is not 
a K2 cutset and Gi,G2 and P are chosen so that |P| is minimized. (Note that 
P exists since {m, u} is not a K2 cutset of G). Then by the minimality of P, no 
node of P is contained in a chordless cycle containing edge e. By Lemma 6, Gi 
is a chordless cycle in a 3-path configuration or wheel H, contained in G' . Since 
any edge in a 3-path configuration or wheel is contained in two chordless cycles, 
V{Ci) U V{C2) C V{H). But then G1G2 is an edge of G®, a contradiction. 

So {u,v} is not a K2 cutset in the graph induced by V{Gi) U V{G2). Let 
G2 G G2, such that for some G G Gf, {u,v} is not a K2 cutset in the graph 
induced by V (G) U V (G2). Let C2 he u = vi, . . . ,Vm = v- Let be the node of 
lowest index in V (C2) \ V (G) and let Sc be the component of the graph induced 
by V (G2) \ V (G) containing node . Amongst all C G GI such that {u, u} is not 

a K2 cutset in the graph induced by V (G) U V (G2), let Gi be the chordless cycle 
for which the node has the highest index and with respect to that | S'ci I is 
smallest possible. By Lemma 6, Gi is a chordless cycle of a 3-path configuration 
or wheel H contained in V{Gi) U y(S'ci)- Let G3 be the chordless cycle of H 
distinct from Gi containing edge e. We show that C3 contradicts the choice of 
Gi. Since H contains Gi and G3, G3 G Gf . Also G2 and G3 have a common node 
which is distinct from u or v and so uv is not a K2 cutset in the subgraph of G, 
induced by V{G3) U V{C2). If is contained in V{G3) then has an index 
higher than i, a contradiction, otherwise since Sc^ Q Sci and some node of Sci 
belongs to C3, jS'cal < |5'ci|, a contradiction. □ 

Proof of Theorem 1: The necessity of the condition is obvious. We prove the 
sufficiency by contradiction. Assume that G and /? are chosen so that G is a 
counterexample to the theorem with respect to f3 and V{G) is minimal. Then G 
is connected and by Remark 5 G contains no K\ or K2 cutset. 
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Let e = uv he any edge of G and let = G\u, Gy = G\v and Guv = 
G\ {t6, ?;}. By the minimality of graph G, G„, G„ and Guv are respectively 

and balanceable and Remark 3 shows that a /3-balancing of Guv can 

be extended to a /3-balancing of G„ and to a /3-balancing of G„. To complete 
the signing of G, label uv arbitrarily. Now we have signed G so that: 

Every chordless cycle of G which is incorrectly signed contains edge e = uv. 

Let the chordless cycles of G containing edge uv be partitioned into the 
incorrectly signed B and the correctly signed C. Both sets are nonempty, else, 
by possibly switching the label of uv, we have a /3-balancing of G. Furthermore 
{u,v} is not a K 2 cutset of G. 

Hence by connectivity of G® (Lemma 7), there exists an edge G 1 G 2 in G®, 
where Gi G B and C 2 G C. Any edge in a 3-path configuration or a wheel is 
contained in exactly two chordless cycles, thus G contains a 3-path configuration 
or a wheel with exactly one chordless cycle in B and by Remark 4, we are done. 
□ 

2 Even and Odd-Signable Graphs 

A hole in an undirected graph G is a chordless cycle of length greater than three. 
Signed graphs provide a useful setting for studying graphs without even or odd 
holes. 

A graph G is even-signable if G is /3-balanceable for the vector /3c = 1 if G is 
a triangle of G and /3c = 0 if G is a hole of G. A graph G is odd-signable if G is 
/3-balanceable for the vector /3 of all ones. Even-signable graphs were introduced 
in [6] and odd-signable in [4]. 

Note that G contains no odd hole if and only if G is even-signable with all 
labels equal to one and G contains no even hole if and only if G is odd-signable 
with all labels equal to one. 

A graph of type (b) may also be a wheel of type (d), when at least one of 
the paths Pi, P2, P3 is of length one. To separate these cases, from now on, we 
impose that all three paths in graph of type (b) have length greater than one. 
With this assumption, all chordless cycles of graphs of type (a), (b) and (c) are 
holes except the triangle of (b) and the two triangles of (c). Furthermore the rim 
of a wheel is a hole unless the wheel is K 4 . 

We now derive from Theorem 1 co-NP characterizations of even-signable and 
odd-signable graphs. 

For graphs of type (d) (the wheels), when the center together with the nodes 
of the hole induces an odd number of triangles the wheel is called an odd wheel. 
When the center has an even number of neighbors on the hole the wheel is called 
an even wheel. (Notice that a wheel may be both odd and even and K 4 is a wheel 
that is neither even nor odd). 

In a signed graph G the weight of a subgraph H is the sum of the labels of 
the edges contained in H. 

Theorem 8. A graph is even-signable if and only if it contains no 3PC{xyz,u) 
and no odd wheel. 
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Theorem 9. A graph is odd-signable if and only if it contains no 3PC{x,y), 
no 3PC{xyz,uvw) and no even wheel. 

Proof of Theorem 8: In a 3PC{xyz,u) the sum of the weights of the three 
holes modulo 2 is equivalent to the weight of the edges of the triangle, since 
all other edges are counted precisely twice. So if a 3PC{xyz,u) is signed so 
that its holes have even weight, then the triangle also has even weight. Thus a 
3PC{xyz, u) is not odd-signable. 

Similarly in a wheel the sum of the weights of the chordless cycles containing 
the center is equivalent modulo 2 to the weight of the rim. In an odd wheel the 
number of triangles containing the center is odd and so if the wheel is signed so 
that the weights of the chordless cycles containing the center are correct then 
the weight of the rim is odd. 

Consider graphs (a) and (c) . By labeling 1 all edges in the triangles and 0 all 
other edges, we obtain a correct labeling of these graphs. In wheel {H,x), that 
is not odd label 1 all edges of H that belong to a triangle of {H, x) and 0 all 
other edges. □ 

Proof of Theorem 9: In a 3PC{x,y) the sum of the weights of two of the 
holes is equivalent modulo 2 to the weight of the third, since the edges in the 
intersection of the two are counted twice and the remainder induce the third. 
Thus if the graph is signed so that these two holes have odd weight then the 
weight of the third is even. So a 3PC{x, y) is not odd-signable. 

Similarly in a 3PC{xyz, uvw) the sum of the weights of all three holes is 
equivalent modulo 2 to the sum of the weights of the two triangles. If the graph 
is signed so that the weight of the three holes is odd then at least one of the 
triangles must have even weight. 

In an even wheel the weight of the rim is equivalent modulo 2 to the sum 
of the weights of the other chordless cycles. Since there are an even number of 
these, each with odd weight, the rim has even weight. 

Consider a graph of type (b). By labeling 1 all edges in triangles and 0 all 
other edges, we obtain an odd signing of these graphs. To label a wheel {H,x) 
that is not even, on every subpath of H with endnodes adjacent to x and no 
intermediate node adjacent to x, label 1 one edge and label 0 all other edges of 
{H, x). This gives an odd signing of the wheel. □ 

The recognition problem for both even-signable and odd-signable graphs is 
still open. In [4] both problems are solved for graphs that do not contain a cap 
as induced subgraph. (A cap is a hole H plus a node that has two neighbors in 
H and these neighbors are adjacent). 

In [3] a polynomial time recognition algorithm is given, to test if a graph G 
contains no even hole (i.e. G is odd-signable with all labels equal to one). 

3 Universally Signable Graphs 

Let G be a graph which is /3-balanced for all 0, 1 vectors /? that have an entry of 
1 corresponding to the triangles of G. Such a graph we call universally signable. 
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Clearly triangulated graphs i.e. graphs that do not contain a hole are universally 
signable. In [5] these graphs are shown to generalize many of the structural prop- 
erties of triangulated graphs. Here we show a decomposition theorem that follows 
easily from the co-NP characterization of these graphs as given by Theorem 1. 

Theorem 10. A graph G is universally signable if and only if G contains no 
graph of type (a), (b), (c) or (d) which is distinct from K 4 . 

In view of the previous remark, the above condition is equivalent to the 
condition: ”no hole of G belongs to a graph of type (a), (b), (c) or (d)”. Now 
the proof of the above theorem follows from Theorem 1 . 

As a consequence of Theorem 10 and Lemma 6 we have the following decom- 
position theorem. 

Theorem 11. A connected universally signable graph that is not a hole and is 
not a triangulated graph contains a K\ or K 2 cutset. 

It was the above decomposition theorem that prompted us to look for a new 
proof for Theorem 1 . 

Now Theorem 11 and the following result of Hajnal and Suranyi [11] can be 
used to decompose with clique cutsets a universally signable graph into holes 
and cliques. 

Theorem 12. A triangulated graph that is not a clique contains a clique cutset. 



4 CK-Balanced Graphs, Regular and Balanceable Matrices 

Actually, Truemper proved the following theorem that he also showed to be 
equivalent to Theorem 1. 

Let a be a vector with entries in {0, 1, 2, 3} indexed by the chordless cycles of 
a graph G. A graph G = (P, E) is a-balanceable if its edges can be labeled with 
labels of —1 and -1-1 so that for every chordless cycle C of G, 1{C) = ac mod 4. 
Such a signing is an a-balancing of G. 

Theorem 13. A graph is a-balanceable if and only if ac is even for all even 
length chordless cycles G and odd otherwise and every subgraph H of G of type 
(a), (b), (c) or (d) is a^ -balanceable. 

To see that the two theorems are equivalent note that an a-balancing of G 
with labels of 1 and —1, is implied by a /3-balancing with (3 = mod 2, 

by replacing the O’s by — I’s. Similarly the /3-balancing of G with labels of 0 and 
1 is implied by an a-balancing with ac = (2/3c -k |if(C')|) mod 4, by replacing 
the —I’s by O’s. 
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Balanceable and Balanced Matrices 

The bipartite graph G{A) of a matrix A has the row and column sets of A as 
color classes and for all entries ^ 0, G{A) has an edge ij of label atj. 

A 0, ±1 matrix A is balanced if G{A) is a-balanced for the vector a of all 
zeroes. A 0, 1 matrix A is balanceable if G{A) is a-balanceable for the vector a 
of all zeroes. (From now on, signing consists of replacing some of the I's with 
-I's). 

Note that the same signing algorithm of Section 1, applied to G(A), can be 
used to obtain a balanced matrix from A, when A is balanceable. Here signing 
the edges of G{A) means assigning labels ±1. 

We can now derive from Theorem 13 a co-NP characterization of balanceable 
matrices: 

Theorem 14. A 0, 1 matrix A is balanceable if G{ A) contains no 3PG{x,y) 
where x and y belong to opposite sides of the bipartition and no wheel, where the 
center node has odd degree. 

Proof: By Theorem 13 we only need to find in G{A) the subgraphs of type (a), 
(b), (c) or (d) that are not balanceable. Since G{A) is bipartite it cannot contain 
graphs of type (b) or (c). Graphs of type (a) with both endnodes in the same 
side of the bipartition are seen to be balanceable by signing the edges so that the 
three paths have the same length mod 4. When the two nodes of degree 3 belong 
to opposite sides of the bipartition then since two of the paths have the same 
length mod4, either 1 mod 4 or 3 mod 4 there exists a chordless cycle signed 
incorrectly with respect to a of all zeroes. 

For a wheel {H, x), let Ci, . . . ,Gk be the chordless cycles of {H, x) containing 
X. Obtain a signing of the graph so that Ci,. . . ,Gk are signed correctly. For F C 
E, let 1{F) = EeGF Ke)- Then KG^) = 0 mod 4. But 1{H) = KCi) ~ 
21{S) where S consists of all edges with one endpoint the center node of the 
wheel. Since 21{S) = 2|S'| mod 4, clearly 1{H) = 0 mod 4 if and only if fc = jS”! 
is even. □ 

In [8] , [2] , a polynomial algorithm is given, to recognize if a matrix is balance- 
able or balanced. Balanced 0, ±1 matrices have interesting polyhedral properties 
and have been recently the subject of several investigations, see [7] for a survey. 



Totally Unimodular and Regular Matrices: A Theorem of Tutte 

A matrix A is totally unimodular (TU, for short) if every square submatrix of 
A has determinant 0,±1. Consequently a TU matrix is a 0,±1 matrix. If A 
is a 0, ±1 matrix such that G{A) is a chordless cycle G, then det(A) = 0 if 
1{G) = 0 mod 4 and det{A) = ±2 if 1{G) = 2 mod 4. So if A is TU, then A is 
balanced. 

A 0, 1 matrix A is regular if A can be signed to be TU. An example of a 0, 1 
matrix that is not regular is one whose bipartite graph is a wheel with a rim of 
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length 6 (and the center node has obviously three neighbors in the rim). We will 
see this later in this section. 

To state the theorem of Tutte characterizing regular matrices, we need to 
introduce the notion of pivoting in a matrix. Pivoting on an entry e yf 0 of a 



matrix A = 


r T 1 
e y^ 


, we obtain the matrix B = 


r T 1 

-€ 

T~\ T 




X D 




X D — exy 



Remark 15. Let B be obtained from A by pivoting on the nonzero entry aij . 
Then: 



— A can be obtained from B by pivoting on the same entry. 

— Let Qij be the pivot element. Then bij = —aij. For I yf j, bn = an and for 
ky^i, bkj = akj. For I yf j and ky^i, bki = au - atjanakj 

— det{A) = ±det{D — exy"’") and det{B) = ±det{D). 

We are interested in performing the pivot operations on A both over the reals 
(i?-pivoting) and over GF2 (GJ^2-pi voting). Let i? be a matrix obtained from 
A by performing a GF2-pivot or an i?-pivot. We next show how to obtain G{B) 
from G{A). 

Remark 16. Let B be the 0, 1 matrix obtained from a 0, 1 matrix A by GF2- 
pivoting on aij = 1. Then G{B) is obtained from G{A) as follows: 

1) For every 4-cycle C = of G{A) remove edge uivi. 

2) For every induced chordless path P = ui,i,jj'Ci of G{A), add edge uivi. 



Proof: Follows from Remark 15. 



□ 



It is easy to check that a 2 x 2, ±1 matrix is singular if and only if the sum of 
its entries is equivalent to 0 mod 4. A 0, ±1 matrix A is weakly balanced if every 
4-cycle G of G{A) satisfies 1{C) = 0 mod 4. Equivalently, A is weakly balanced 
if every 2x2 submatrix of A has determinant 0,±1. 

Remark 17. Let B be the matrix obtained from a weakly balanced 0, ±1 matrix 
A by R-pivoting on a non-zero entry aij = e. Then B is a 0,±1 matrix and 
G{B) is obtained from G{A) as follows: 

1) Edge ij has label —e. 

2) For every 4~cycle u\,i,j,vi in G{A) remove edge uivi. 

3) For every induced chordless path P = ui,i,j,vi in G{A) add edge uiVi 
and label it so that, for the resulting cycle G = ui,i,j,vi in G{B), 1{G) = 
0 mod 4. 



Proof: 1) is trivial. By Remark 15, for k ^ i and I yf j, bu = aki — eakjan. 
Note that = 1 . So ebki is the value of the determinant of the 2x2 submatrix 
of A with rows i,k and columns j,l. Since A is weakly balanced, ebki and bki, 
have values in 0,±1. For 2), note that all 2 x 2 submatrices of A with all four 
entries non-zero have determinant 0. Finally since the 2x2 submatrix of B has 
determinant 0, part 3) follows. □ 
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Corollary 18. Let A he a weakly balanced 0, ±1 matrix and A be the 0, 1 matrix 
with the same support. Let B and B be the matrices obtained by R-pivoting A 
and GF2-pivoting A on the same entry. Then G{B) and G{B) have the same 
edge set. (Equivalently, B and B have the same support). 

Tutte [16], [17] proves the following: 

Theorem 19. A 0, 1 matrix A is regular if and only if for no matrix B, obtained 
from A by GF2-pivoting, G{B) contains a wheel whose rim has length 6. 

To prove the above result, we need the following three lemmas (the first is 
well known): 

Lemma 20. A 0, 1 matrix A is regular if and only if any matrix obtained from 
A by GF2-pivoting is regular. 

Proof: Follows by Remark 15 and Corollary 18. □ 



Lemma 21. Let A he a balanced 0, ±1 matrix, B be obtained by R-pivoting A 
on dij and B be the 0, 1 matrix with the same support as B. Lf B is not balanced, 
then B is not balanceable. 

Proof: We show that G{B) can be obtained from G{B) by applying the signing 
algorithm. Let T be any tree in G{B), chosen to contain all edges in {ij} U 
{ix : X G N{i)} U {jy : y G N{j)}. Then T is also a tree of G(A). Let 

S = t\,. . . , t|T|-i) ei, . . . , e/ be a consistent ordering of the edges of G{B), where 
ti are edges in T. We show that the signing of G{B) can be obtained by the 
signing algorithm with sequence S, where the edges of T are labeled as in G{B). 
Let Cfe be an edge of S and Ce^ be a chordless cycle of G{B) containing and 
edges in S' \ Ck+i, ■ . ■ ,em, such that Ge^, has the largest possible intersection 
with {i,j} and, subject to this, Ge*. is shortest. We show that Ge^ forces to 
be signed as in G{B). 

Remark 17 shows that if Ge^ contains both nodes i and j and has length 4, 
then Ggfc forces Cfc to be signed as in G{B). 

All other edges Cfc are labeled the same in G{B) and G{A). We show that 
Gej, forces this signing of edge Cfc. 

If Gej, contains both nodes i and j and has length bigger than 4, then in 
G{A) the nodes of Ge^ induce a cycle with unique chord ziji, where i\ and ji 
are the neighbors of i and j in Gg^. By Remark 17, the sum of the labels on 
the edges iii,ij,jji in G{B) is equivalent modulo 4 to the label of edge iiji, in 
G{A). Thus the cycle G(^ of G{A) induced by V{Ge^0 \ {i,j} and the cycle Gg^ 
of G{B) force Ck to be signed the same. 

If Ggj, contains one of {i,j}, say i, then by choice of Gg^., node j has i as 
unique neighbor in Ggj, . For, if not, either belongs to a chordless cycle of G(R) 
of to a chordless cycle that is shorter that Ggj, and contains node j (this happens 
when (Gg^ ,j) is the rim of a wheel with center j and no hole of (Ggj, , j) contains 
i, j and Cfe), a contradiction to our assumption. 
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But then Ce^ is also a chordless cycle of G{A) and forces Cfc to be signed as 
in G{B). 

If Ge^ contains neither i nor j and at most one neighbor of i or j then Ge^ is 
also a chordless cycle of G{A) and forces to be signed as in G{B). Otherwise 
by the choice of Cg*. , node i has a unique neighbor i' in , node j has a unique 
neighbor j' in Cgj, and i', j' are adjacent. So, by Remark 17, G{A) contains a 
hole Cgj,, whose node set is V{Ge^) U This hole and the hole Cg^ of 

G{B) force Cfc to be signed the same. □ 



Lemma 22. From every 0, 1 matrix A that is not regular, we can obtain o 0, 1 
matrix that is not halanceahle by a sequence of GF2-pivots. 

Proof: Let A be the smallest 0, 1 matrix (in terms of the sum of the number of 
rows and columns) that is not regular but cannot be pivoted to a matrix that 
is not balanceable. Since A is obviously balanceable, let A be a corresponding 
balanced 0, ±1 matrix. By minimality, we can assume that A is square and 
\det{A)\ > 2. By Remark 15, we can i?-pivot on any nonzero entry of A to 
obtain a 0,±1 matrix B which contains a proper submatrix G with the same 
determinant value as A. Since A is weakly balanced, by Remark 17, B, and 
hence C, is a 0, ±1 matrix. Let B be the 0, 1 matrix with the same support as B 
and C the submatrix of B corresponding to C. By Corollary 18, B is obtained 
from A with a GF2-pivot on the same element. Assume B is balanced. Then C 
would be a balanced matrix which is not TU. However, this implies that G is 
not regular (this was already known to Camion [1]): Indeed, G is a signing of G 
which is balanced but not TU: So G can be obtained by applying the signing 
algorithm on G(G), starting with a tree T of G(G). Assume G has a TU signing 
G'. Since C' is also a balanced matrix, then G(G') can be obtained through the 
signing algorithm by signing T as in G(G'). So G(G) and G(G') differ on some 
fundamental cuts of T . So G can be transformed in G' by multiplying by —1 the 
rows and columns corresponding to the nodes in on shore of this cut. However 
this operation preserves the TU property. 

So B is not balanced and by Lemma 21, B is not balanceable. □ 

Proof of Theorem 19: By Lemma 20, regular matrices are closed under GF2- 
pivoting and if A is a 0, 1 matrix such that G{A) contains a wheel G(W) whose 
rim has length 6, then W (hence A) is obviously not regular. 

For the sufficiency part, if A is a 0, 1 matrix which is not regular, then by 
Lemma 22, we can obtain by GU2-pivots a 0, 1 matrix B which is not bal- 
anceable. By Theorem 14, G{B) contains a 3PG{x,y) where x and y belong to 
distinct color classes, or a wheel with rim FI and center v and v has an odd 
number, greater than one, of neighbors in FI . 

If G{B) contains a 3PG{x,y), Remark 16 shows that we can GU2-pivot on 
B so that all of its paths have length three and by doing a last GU2-pivot on an 
entry corresponding to an edge incident to x, we obtain a wheel whose rim has 
length 6. 
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If G{B) contains a wheel {H, x) and x has an odd number of neighbors in 
the rim H , Remark 16 shows that we can G_F2-pivot on an entry corresponding 
to an edge of H, incident with a neighbor of x, to obtain a wheel {H' , x), where 
X has two less neighbors in H' than in JI. When x has only three neighbors in 
H', to obtain a wheel whose outer cycle has length 6, GF2-pivot so that all the 
subpaths of H', between two consecutive neighbors of x have length two. □ 

Tutte’s original proof of the above theorem is quite difficult. A short, self- 
contained proof can be found in [10]. In [12], a decomposition theorem for regular 
matrices in given, together with a polynomial algorithm to test if a matrix is 
regular or TU. A faster algorithm is given in [15]. 

5 Decomposition 

The co-NP characterizations obtained in Theorems 8, 9 and 14 are used in [3], 
[4], [8], [2] to obtain the decomposition results for graphs without even holes, 
cap- free graphs and balanceable matrices. However the proofs of these theorems 
are long and technical. We have seen how Theorem 1 can be used to decompose 
universally signable graphs with Ki and K 2 cutsets into holes and triangulated 
graphs. Here we further illustrate in two easy cases the use of a co-NP character- 
ization to obtain decomposition results and polynomial recognition algorithms. 



Restricted Unimodular and Totally Odd Matrices 

A 0, ±1 matrix A is restricted unimodular (RU, for short) if every cycle C (possi- 
bly with chords) of G{A) satisfies 1{G) = 0 mod 4. A 0, 1 matrix A is signable to 
be RUii there exists a RU 0, ±1 matrix that has the same support. RU matrices 
are a known subclass of TU matrices, see e.g. [18]. 

A 0,±1 matrix A is totally odd (TO, for short) if every cycle G of G{A) 
satisfies 1{G) = 2 mod 4. A 0, 1 matrix A is signable to be TO if there exists a 
TO 0, ±1 matrix that has the same support. TO matrices are studied in [9]. 

A weak 3-path configuration between nodes x and y (W3PG{x,y)) is made 
up by three paths Pi, P 2 , P 3 connecting x and y such that Pi LI Pj, i yf j, 
i,j = 1,2,3 induces a cycle (possibly with chords). So Pi may be a single edge 
or may contain chords and edges may have endnodes in distinct paths Pi and 
Pj. If G is a bipartite graph, a WiPG{x,y) is homogeneous is x and y belong 
to the same color class of G and is heterogeneous otherwise. 

Theorem 23. A 0, 1 matrix A is signable to be RU if and only if G{A) contains 
no weak 3-path configuration which is heterogeneous and A is signable to be TO 
if and only if G{A) contains no weak 3-path configuration which is homogeneous. 

Proof: Let G' be the bipartite graph obtained from G{A) by replacing each edge 
with a path of length 3 and A! be the 0, 1 matrix such that G' = G(A'). Then 
there is a correspondence between the cycles of G(A) and the holes of G' . So 
A is signable to be RU if and only if G' is a-balanceable for the vector a of all 
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zeroes (i.e. A! is a balanceable matrix) and A is signable to be TO if and only 
if G is a-balanceable for the vector a of all twos. In the first case, the theorem 
follows from Theorem 14 . The proof for the second case in analogous and is left 
as an exercise. □ 

A bridge of a cycle C is either a chord of C or a subgraph of G, whose node set 
contains all nodes of a connected component of G \ K (G), say G , together with 
the nodes of G, adjacent to at least one node in G and whose edges are the edges 
of G with at least one endnode in G' . The attachments of a bridge B are the 
nodes of V{B) n V{G). A bridge is homogeneous if all of its attachments belong 
to the same color class of G and is heterogeneous if no two of its attachments 
belong to the same color class of G. Obviously, if B is a heterogeneous bridge, 
then B has at most two attachments. 

Lemma 24 . Let A &e 0, 1 matrix that is signable to be RU and G any cycle of 
G{A). Then every bridge of G is homogeneous. 

Let A be 0,1 matrix that is signable to be TO and G any cycle ofG(A). Then 
every bridge of G is heterogeneous. 

Proof: We prove the first statement. Let x and y be two attachments of a bridge 
B of a cycle G. If x and y belong to distinct color classes of G{A), then G{A) 
contains a heterogeneous W 3 PG{x,y) where Pi, P2 are the two xy-subpaths of 
G and P3 is any xy-path in B. The proof of the second statement is similar. □ 

Bridges B\ and B2 of G cross if there exist attachments x\, y\ of B\ and X2, 
7/2 of B2 that are distinct and appear in the order x\,X2,yi,y2 when traversing 
G in one direction. 

Lemma 25 . Let A 6e 0, 1 matrix that is signable to be RU and G any cycle of 
G{A). Then no pair of homogeneous bridges ofG, having attachments in distinct 
color classes ofG(A), cross. 

Let A be 0,1 matrix that is signable to be TO, G any cycle ofG(A). Then 
no pair of heterogeneous bridges of G cross. 

Proof: To prove the first statement, assume Bi and B2 are homogeneous bridges 
of G having attachments x\, y\ of B\ and X2, 2/2 of B2, appearing in the order 
x\,X2,yi,y2 when traversing G. If x\,yi and X2,y2 are in distinct color classes 
of G{A), we have a heterogeneous W3PG{x\,X2), where P\ is the subpath of 
G, connecting x\, X2 and not containing y\. P2 and P3 contain respectively a 
xi, 7 /i-path in B\ and a X2, 7/2-path in B2. The proof of the second part is similar. 
□ 



Theorem 26 . Let A 6e 0, 1 matrix that is signable to be TO, G any cycle of 
G{A) and B be a heterogeneous bridge of C with two attachments x and y. Then 
G{A) \ {x,y} is disconnected. 
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Proof: By Lemma 24 , x and y are the only two attachments of B. Let Pi, P2 be 
the two subpaths of C, connecting x and y. By Lemma 25 , no bridge of C has 
an attachments in both Pi \ {x,y} and P2 \ {x,y}. So no two of B, Pi and P2 
are in the same component of G{A) \ {x, y} and, since at least two of them are 
not edges, G{A) \ {x,y} contains at least two components. □ 

Theorem 27 ([ 18 ]). Let A be 0,1 matrix that is signable to be RU and G 
any cycle of G{A) containing homogeneous bridges Bi and B2 with attachments 
in distinct color classes of G{A). Then G contains two edges whose removal 
disconnects G{A) and separates Bi and i?2- 

Proof: Assume that the attachments of Bi and B2 belong to the ’’red” and 
’’blue” sides of the bipartition of G{A). Let Pi be be minimal subpath of G with 
the following property: 

Pi contains all the attachments of Bi and no bridge ofG with red attachments 
has all its attachments either in Pi or outside Pi 

The subpath P2 is similarly defined, with respect to B2 and the bridges with 
blue attachments. By Lemma 25 Pi and P2 can be chosen to be nonoverlapping. 
Furthermore by minimality of Pi and P2, the endnodes oi, bi of Pi are red nodes 
and the endnodes 02, 62 of P2 are blue nodes. Let C = oi. Pi, 61, Pb^bsi ^27 P27 027 
Pa2ai, let b be any edge of Pbib^ and a any edge of Paia^ - By Lemma 25 and the 
construction of Pi and P2, Pi U Bi and P2 U P2 belong to distinct components 
of G \ {a, b}. □ 

Clearly to test if A is signable to be RU, we can assume that G{A) is bicon- 
nected, otherwise we work on the biconnected components. 

If G{A) is biconnected and contains no cycle with homogeneous bridges with 
attachments in distinct color classes of G{A), then A has two ones per row or per 
column. (This is easy from network flows). In this case A is obviously RU: Sign A 
so that each row or column contains a 1 and a —1 to obtain a network matrix (or 
its transpose). From this fact and the above theorem yield in a straightforward 
way a polytime algorithm to test if a 0, 1 is signable to be RU. This algorithm, 
combined with the signing algorithm of Section 1 , gives a procedure to test if a 
0,±1 matrix is RU. 

In a similar manner, see [ 9 ], Theorem 26 and the signing algorithm give 
procedures to test if a 0, 1 matrix is signable to be TO and to test if a 0 , ±1 
matrix is TO. 
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Abstract. Bidirected graphs are a generalization of undirected graphs. 
The generalized stable set problem is an extension of the maximum 
weight stable set problem for undirected graphs to bidirected graphs. 
It is known that the latter problem is polynomially solvable for claw-free 
undirected graphs. In this paper, we define claw-free bidirected graphs 
and show that the generalized stable set problem is also polynomially 
solvable for claw- free bidirected graphs. 



1 Introduction 

Let G = (V, E) be an undirected graph. A subset S' of U is called a stable 
set if any two elements of S are nonadjacent. Given a weight vector w G 3?^, 
a maximum weight stable set is a stable set S maximizing w{S) = 

The problem of finding a maximum weight stable set is called the maximum 
weight stable set problem (MWSSP). It is well known that the problem can be 
formulated as the following integer programming problem: 

[AIWSSP] maximize w ■ x subject to Xi + Xj <l for (z, j) G E, 

Xi G {0, 1} for i gV. 

In this paper, we consider the problem generalized as follows: for a given finite 
set V and for given P,N,I CV x V, 

[GSSP] maximize w ■ x subject to Xi + Xj < 1 for (z, j) G P, 

—Xi — Xj < —1 for (z, j) G N, 

Xi — Xj < 0 for (z, j) G I, 

Xi G {0, 1} for i gV. 

Here we call this problem the generalized stable set problem (GSSP). We note 
that the GSSP is equivalent to the generalized set packing problem discussed in 
[1,2]. To deal with the GSSP, a ‘bidirected’ graph is useful. A bidirected graph 
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G = (V,E) has a set of vertices V and a set of edges E, in which each edge 
e G E has two vertices i,j G V as its endpoints and two associated signs (plus 
or minus) at i and j. The edges are classified into three types: the (+,+)-edges 
with two plus signs at their endpoints, the )-edges with two minus signs, 
and the (+, — )-edges (and the (— ,+)-edges) with one plus and one minus sign. 
Given an instance of the GSSP, we obtain a bidirected graph by making (+, +)- 
edges, (— , —(-edges and (-I-, —(-edges for vertex-pairs of P, N and I respectively. 
Gonversely, for a given bidirected graph with a weight vector on the vertices, by 
associating a variable Xi with each vertex, we may consider the GSSP. We call 
a 0—1-vector satisfying the inequality system arising from a bidirected graph G 
a solution of G. We also call a subset of vertices a solution of G if its incidence 
vector is a solution of G. The GSSP is an optimization problem over the solutions 
of a bidirected graph. 

Since several distinct bidirected graphs may have the same set of solutions, 
we deal with some kind of ‘standard’ bidirected graphs. A bidirected graph is 
said to be transitive, if whenever there are edges e\ = (i,j) and 62 = (j, k) with 
opposite signs at j, then there is also an edge 63 = {i, k) whose signs at i and k 
agree with those of ei and 62 - Obviously, any bidirected graph and its transitive 
closure have the same solutions. A bidirected graph is said to be simple if it has 
no loop and if it has at most one edge for each pair of distinct vertices. Johnson 
and Padberg [3] showed that any transitive bidirected graph can be reduced to 
simple one without essentially changing the set of solutions, or determined to 
have no solution. We note that a transitive bidirected graph has no solution if 
and only if it has a vertex with both a (j-, -l-)-loop and a (— , —(-loop. For any 
bidirected graph, the associated simple and transitive bidirected graph can be 
constructed in time polynomial in the number of vertices. 

Given a bidirected graph G, its underlying graph, denoted by G, is defined as 
the undirected graph obtained from G by changing all the edges to (j-, -l-)-edges. 
A bidirected graph is said to be claw-free if it is simple and transitive and if its 
underlying graph is claw-free (i.e., does not contain a vertex-induced subgraph 
which is isomorphic to the complete bipartite graph 

It is well known that the MWSSP is NP-hard for general undirected graphs 
(and hence, the GSSP is also NP-hard). However, for several classes of undirected 
graphs, the MWSSP is polynomially solvable. For example. Minty [4] proposed a 
polynomial time algorithm for the MWSSP for claw- free undirected graphs. On 
the other hand, there are several polynomial transformations from the GSSP to 
the MWSSP (see [5,6]). Unfortunately, we cannot easily derive the polynomial 
solvability of the GSSP for claw-free bidirected graphs by using these transfor- 
mations, because these do not preserve claw- freeness. Our aim in this paper is 
to verify that the GSSP for claw-free bidirected graphs is polynomially solvable. 

2 Canonical Bidirected Graphs and Their Solutions 

In this section, we will give several definitions and discuss basic properties of 
solutions of bidirected graphs. Let G = (U, E) be a simple and transitive bidi- 
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reeled graph and w be a weight vector on V. For any subset U C V, we call 
the transformation which reverse the signs of the u side of all edges incident to 
each u G U the reflection of G at U, and we denote it by G:ll. Obviously, reflec- 
tion preserves simpleness and transitivity. Let w:U denote the vector defined by 
{w:U)i = —Wi if i G [/; otherwise {w:U)i = Wi. For two subsets X and Y of V, 
let X A y denote the symmetric difference of X and Y. 

Lemma 1. Let X be any solution of G. Then, X AU is a solution of G\U . The 
GSSP for (G,w) is equivalent to the GSSP for {G:U,w:U). 

Proof. The first assertion is trivial from the definition of G:ll. The second as- 
sertion follows from the equation w:U{X AU) = ~ 

~ (the last term is a constant). □ 

We say that a vertex is positive (or negative) if all edges incident have plus 
(or minus) signs at it, and that a vertex is mixed if it is neither positive nor 
negative. If a bidirected graph has no (— , — )-edge, it is said to be pure. We say 
that a bidirected graph is canonical if it is simple, transitive and pure and it has 
no negative vertex. For any instance (G, w) of the GSSP, we can transform it to 
equivalent one whose bidirected graph is canonical as follows. From the previous 
section, we can assume that G is simple and transitive. Johnson and Padberg [3] 
proved that G has at least one solution U C V. From Lemma 1, G:U has the 
solution U AU = 0, that is, G:U must be pure. Let W be the set of negative 
vertices of G:U. Then G:U:W has no negative vertex, and furthermore, it is pure 
because any edge (v,w) of G:U with w G W must be a (-I-, — )-edge. Since this 
transformation is done in polynomial time, we assume that a given bidirected 
graph of the GSSP is canonical in the sequel. 

For any solution X of a, canonical bidirected graph G, we partition X into 
two parts: 

Xb = {iG X \ Nfi+{i)nX = il}} and Xi = {i G X \ NQ + {f) n X ib}, 

where NQ~^{i) denotes the set of vertices adjacent to i by a (— , -b)-edge incident 
to i with a minus sign, Ag“(z) is deflned analogously. Here we call Xb a base 
of X. Let 



bx{Xb) = Xb {i GV \ i G Nq (x) for some x G Xb}. 

If S' C y is a stable set of G, we say that S is a stable set of G. It is not difficult 
to show the following lemmas. 

Lemma 2. For any solution X of a canonical bidirected graph G, X = bx{Xb), 
and hence, {ex{XB))B = Xb. 



Lemma 3. For any solution X of a canonical bidirected graph G, its base Xb 
is a stable set of G. 
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Lemma 4. For any stable set S of a canonical bidirected graph G, ex(S') is a 
solution of G. 

Thus there is a one-to-one correspondence between the solutions and the 
stable sets of G. 

For any subset U of V, let G[U] denote the subgraph induced by U. We call 
H CV a, connected component of G if H induces a connected component of G. 

Lemma 5. Let X and Y be solutions of a canonical bidirected graph G. For 
any connected component FI of G[Xb A Yb], Xb Ls FI and Yb A F[ are bases of 
certain solutions of G. 

Proof. From Lemma 3, Xb and Yb are stable sets of G. Thus XbL\F[ and 
Yb A H are also stable sets of G. Hence Lemma 4 implies the assertion. □ 

Let AT be a specified solution of G. For any solution Y of G, let iLi, . . . , F[g be 
the connected components of G[Xb A Yb\. We define the weight of Fti, denoted 
by 5^{Hi) or simply 5{Hi), by 

5^ {Hi) = w{e^{XB A Hi)) - w{X). 

We remark that the equation w{Y) — w{X) = ^ 6{Hi) may not hold because 
there may exist a vertex v such that Nq~^{v) contains several vertices of distinct 
connected components, that is, may be doubly counted. In order to avoid 
this obstacle, we require some additional conditions. 

Lemma 6. For any solution X of G, there exists U CV such that G' = G:U 
and X' = X AU satisfy 

(a) G' is canonical, 

(b) X' is a stable set of G' , i.e., X' = {X')b, 

(c) for each mixed vertex v ^ X' , there is a vertex u & X' adjacent to v. 

Proof. Let M be the set of all mixed vertices v such that v ^ X, v is adjacent 
to no vertex of Xb and Nq~{v) = 0. For any (inclusion-wise) maximal stable 
set S of G[M], U = Xj LI S satisfies the assertion. □ 

We note that a subset U having the conditions of Lemma 6 can be found in 
polynomial time. The conditions of Lemma 6 overcome the above obstacle. 

Lemma 7. Let G be a canonical bidirected graph and X be a solution of G 
satisfying the conditions of Lemma 6. For any solution Y , let Hi, ... , H( be the 
connected components of G[Xb AYb]. Then, 

t i 

w{Y)-w{X) = = y^{w(ex(X^ A Hj)) - w(X)}. 

i=l 

Proof. Suppose to the contrary that there exists a mixed vertex v such that 
Nq~^{v) contains two vertices u and w of distinct connected components Hi and 
Hj. Since AT is a stable set, u,w G Yb. Let a; be a vertex of X adjacent to v. 
From the transitivity, x must be adjacent to both u and w. This contradicts the 
fact that Hi and Hj are distinct connected components of G[Xb A Yb]. □ 
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3 A Basic Idea for Finding an Optimal Solution of the 
GSSP 

Given an instance (G, w) of the GSSP, for each i = 0, 1, . . . , |P |, let 

5i = {XCP|Xisa solution of G and has exactly i positive vertices }, 

w* = max w(X), 
xeSi ^ ’ 

S* = {X G 5, I w{X) = u>*}. 

Suppose that N denotes the smallest number j with = max, w* . Minty [4] 
showed that if a given undirected graph is claw- free, then < ■ ■ ■ < . More 

precisely, (0, . . . , {N, w^) lie on an increasing concave curve. Minty’s algo- 

rithm for solving the MWSSP for claw-free undirected graphs finds an optimal 
solution by tracing (i,w'^) one by one. However, even if a given bidirected graph 
is claw-free, this fact does not hold as an example in Figure 1 where (-I-, +)- 
edges are drawn by lines and (-I-, — )-edges by arrows whose heads mean minus 
signs. Thus, it seems to be difficult to trace one by one for the GSSP. 



3 5 4 





Fig. 2. 
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We will use a technique of the fractional programming. Let us consider the up- 
per envelope of the convex hull of the set of pairs (0, (1, , {N, w^) 

as in Figure 2. We call a Pareto -optimal pair if it lies on the envelope, 

and their solutions Pareto -optimal solutions. Obviously, (0, w°) and {N,w^) are 
always Pareto-optimal. In Figure 2, (0,ru°), (l,w^), (3,w^), (4, (6,w®) and 

(N,w^) are Pareto-optimal. 

Let AT* be a Pareto-optimal solution with X'' G Si. Suppose that IF is a subset 
of all the solutions of G such that X* € IF and T is defined independently to 
the weight vector w. Let us also consider the Pareto-optimal solutions for the 
restriction on T . Obviously, X* is also Pareto-optimal in T . We consider the 
following two problems 

[MAX5] max {<5(P) = w{Y) — , and 

[MAXp] max|p(y) = | <5(r) > o| , 



where v{Y) denotes the difference of the numbers of all the positive vertices of 
Y and XL We denote p(-) and 6{-) for a weight vector w by Pw{-) and (5u)(-) 
explicitly. Suppose that X* is not optimal in T . Let Y^ be an optimal solution 
of the MAX5 for uP = w. We set r = pyjo{Y^) and consider the new weight 
vector defined by 



w} 



— r if z is a positive vertex, 
otherwise. 



( 1 ) 



Then, = 0. For any solution Y € IF, 



Pw^{Y) 



5 wo{Y) - r ■ iy{Y) 

HY) 



Pw°{Y) - r. 



Thus, X* is Pareto-optimal in T for . We now assume that there is a so- 
lution Y* with pnjo(Y*) > pu)o(y^) and <5^)0 (X*) > 0. Then, evidently, 0 < 
v{Y*) < jy{Y^). We also have (5u}i(X*) = — r-zz(X*)] = z/(X*)[pu,o(X*) — 

Pii,o{Y^)] > 0. Conversely, if <5^)1 (X*) > 0 then pu,o{Y*)>pij,o{Y^) and 6njo(Y*) > 
0. Summing up the above discussion, for an optimal solution X^ of the MAX(5 
for w^, if (5^)1 (X^) = 0 then X^ is an optimal solution of the MAXp for w; oth- 
erwise, by repeating the above process at most \V\ times, the MAXp for w can 
be solved, because of the fact that n{Y^) > zz(X^) > • • • > 0. 

From the above discussion, for each Pareto-optimal solution X* G if we 
can easily define a subset !F such that 

(Al) X* G IF and S* P T ^ tb where is the next Pareto-optimal pair, 

and 

(A2) the MAX(5 for IF and for any w can be solved in time polynomial in the 
number of vertices of G, 

then we can either determine X* is optimal or find a Pareto-optimal solution 
X^ G with z < A: < X in polynomial time. (We may find (4, zc^) from (1, w^) 
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in Figure 2.) In addition, if G Sq can be found in polynomial time, the GSSP 
for (G, w) can be solved in polynomial time. In fact, this initialization is not so 
difficult if we can apply the above technique for any vertex-induced subgraph of 
G, because it is sufficient to solve the GSSP for the bidirected graph obtained 
from the current one by deleting all the positive vertices, recursively. 

Finally we introduce a tool in order to trace Pareto-optimal pairs. Let 
be a Pareto-optimal solution with i < N. Without loss of generality, we assume 
that X* and G satisfy the conditions of Lemma 6. We say that iL C P is an 
alternating set for X* if H is connected in G and if X® A iL is a stable set of 
G. We define the weight 6{H) of an alternating set H with respect to w by 
w(ex(W AiL)) - w(W). 

Lemma 8. Let {j,w^) he the next Pareto-optimal pair of (i,w’‘). Then, for any 
X^ G S* , there exists a connected component H of G[Xg A X^] such that 
ex{Xg A H) is a Pareto -optimal solution with more positive vertices than X®. 

Proof. For each connected component PI , we denote by v{H) the difference of 
numbers of positive vertices of and X®niL. It is not difficult to show that 

either {5{P[) > 0 and Jz(iL) > 0) or (<5(iL) = 0 and v{P[) = 0) or {5{P[) < 0 and 
v{H) < 0). Here we ignore the second case. Since i < j, G[{Xg A X^^)] must have 
at least one connected component of the first case. For each connected component 
H of the first or third case, let p{H) = 6{H)/v{H). From the minimality of j 
and the Pareto-optimality of if H is of the first case, then p(iL) < 

{w^ —w'‘)/{j—i). Similarly, if iL is of the third case, then p(iL) > {w^ — w^)/{j—i). 
By combining the above inequalities and Lemma 7, one can obtain p(i7) = 
for any PI of the first or third case. Hence, any connected component H of the 
first case satisfy the assertion. □ 

Lemma 8 says that we can trance Pareto-optimal solutions by using alternating 
sets. 

4 Finding a Next Pareto-Optimal Solution 

Let G, w and X he a, given claw-free bidirected graph, a given weight vector on 
the vertices and a Pareto-optimal solution with respect to w. Without loss of 
generality, we assume that G and X satisfy the conditions of Lemma 6. In this 
section, we explain how to find a next Pareto-optimal solution. 

We first give several definitions. We call the vertices of X black and the other 
vertices white. Any white vertex is adjacent to at most two black vertices, since 
otherwise G must have a claw. A white vertex is said to be bounded if it is 
adjacent to two black vertices, free if it is adjacent to exactly one black vertex 
and otherwise super free. A cycle (or path) is called an alternating cycle (or path) 
if white and black vertices appear alternately, and its white vertices form a stable 
set. An alternating path is called free if its endpoints are either black or free or 
super free. Alternating cycles and free alternating paths are alternating sets, 
and vice versa in claw-free cases. Thus, Lemma 8 guarantees that we deal with 
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only alternating cycles and free alternating paths in order to find a next Pareto- 
optimal solution. An alternating cycle or a free alternating path is called an 
augmenting cycle or an augmenting path respectively if it has a positive weight. 
For two distinct black vertices x and y, let W denote the set of all the bounded 
vertices adjacent to both x and y. If W is not empty, W is called a wing adjacent 
to X (and y). A black vertex is called regular if it is adjacent to three or more 
wings, irregular if it is adjacent to exactly two wings, and otherwise useless. 
An alternating cycle is said to be small if it has at most two regular vertices; 
otherwise large. Here we call Ci, . . . ,Ck a large augmenting cycle family if each 
Ci is a large augmenting cycle and each vertex in Ci is adjacent to no vertex in 
Cj for 1 < i < j < k. From Lemma 7, S{Ci U • • • U Cf) = i5(Ci) -I- • • • -I- S{Ck) 
holds. 

Our algorithm for finding a next Pareto-optimal solution is described by 
using the technique discussed in the previous section: 

(0) ^ w and i ^ 0 ; 

(1) Find a small augmenting cycle of the maximum weight for w* if it 

exists, otherwise go to (2) ; 

Construct the new weight by applying (1), i <— i -|- 1 and repeat (1) ; 

(2) Find a large augmenting cycle family Ai^i of the maximum weight for w‘‘ 
if it exists, otherwise go to (3) ; 

Construct the new weight by applying (1), i <— i -|- 1 and repeat (2) ; 

(3) Find an augmenting path of the maximum weight for w* if it exists, 
otherwise go to (4) ; 

Construct the new weight by applying (1), i <— t -|- 1 and repeat (3) ; 

(4) If i = 0 then X is optimal, otherwise ex(A A Ai) is a next Pareto-optimal 
solution. 

Note that in (2) there is no small augmenting cycle since these are eliminated 
in (1), and that in (3) there is no augmenting cycle since these are eliminated in 
(1) and (2). These facts are important in the following sense. 

Theorem 9. For any weight vector, 

1 . a maximum weight small augmenting cycle can he found in polynomial time, 

2. a maximum weight large augmenting cycle family can be found in polynomial 
time if no small augmenting cycle exists, 

3. a maximum weight augmenting path can he found in polynomial time if no 
augmenting cycle exists. 

By Lemma 8 and Theorem 9, our algorithm find a next Pareto-optimal solu- 
tion in polynomial time. Summing up the above discussions, we obtain our main 
theorem. 

Theorem 10. The GSSP for claw-free hidirected graphs is polynomially solv- 
able. 
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In the rest of the section, we briefly explain a proof of Theorem 9. Our ap- 
proach is an extension of Minty’s algorithm for undirected claw-free graphs. This, 
however, does not seem a straightforward extension because we must overcome 
several problems. A significant problem is how to deal with ‘induced weights’. 
Let A be an alternating cycle or a free alternating path. Then its weight is 
expressed as 

Sx(A) = w(A—X) — w{XnA) -1- I ^ is mixed, Nq~^{v) O {A—X) ^ 0}. 

We call the ^ term the induced weight, which appears in the bidirected case but 
not in the undirected case. 

We first consider cycles. Let xi,. . . ,Xk with A: > 3 be distinct black vertices 
and Wi,. . . , Wk, Wk+i = Wi be wings such that Xi is adjacent to Wi and Wi+i 
for i = 1, . . . , fc. Then {W\,x\,W 2 , ■ ■ ■ , Wk, Xk, Wi) is called a cycle of wings. It 
is easy to show the following: 

Lemma 11 ([4]). Let {Wi,xi,W 2 , ■ ■ ■ ,Wk,Xk,W\) with k > 3 be a eycle of 
wings and y^ G Wi for i = 1, . . . ,k. Then {yi,xi,y 2 , ■ ■ .,yk,Xk,yk+i = yi) is an 
alternating cycle if and only if yi is not adjacent to yi+\ for i = 1, . . . ,k. 

Lemma 12. Let v be a mixed vertex such that Nq~^{v) has a bounded vertex 
but is not included in a wing. Then there uniquely exists a black vertex x such 
that [x = v or X is adjacent to f] and all the vertices in Nq~^{v) are adjacent to 

X. 



Proof. It is trivial if v is black. Suppose that v is white. Let y be a bounded 
vertex in Nq~^(v), and let xi and X 2 be the black vertices adjacent to y. Since 
they are in X which is a stable set, xi is not adjacent to X 2 . Thus, without loss of 
generality, we can assume that v is adjacent to x\ since otherwise {y,v,xi,X 2 } 
induces a claw. The edge (v,xi) is not a (— ,-b)-edge because v is white and 
xi is black. That is, the sign of this edge at v is -b. Let y' be any vertex in 
Nq~^{v) — {y}. Then y' must be adjacent to x\ from the transitivity. Finally 
note that v is not adjacent to X 2 , since otherwise from the same discussion any 
vertex in Nq'^{v) — {y} must be adjacent to X 2 and Nq~^{v) is included in the 
wing adjacent to x\ and X 2 , a contradiction. □ 

Lemma 13. Let C = {W\,x\,W 2 , . . . ,Wk,Xk,Wi) be a cycle of wings (k > 
3). Then a maximum weight alternating cycle included in C can be found in 
polynomial time. 

Proof. Let Wk+i = Wi and Wq = Wk. For i = 1,. . . ,k and for each pair y G Wi 
and z G Wi+i such that y is not adjacent to z, draw a directed ‘red’ edge 
from y to z with weight w{y) — w{xi) + i® mixed, IV q n Wi-i = 

IIi,Nq~^{v) (^Wi^tb and [y G Nq'^{v) or z G From Lemma 11, there 

is a one-to-one mapping between all the directed cycles of red edges and all the 
alternating cycles in C. 
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Let f be a mixed vertex such that Nq~^{v) n {W\ U • • • U Wk) ^ 0. From 
Lemma 12, there uniquely exists i G such that Nq^{v) n Wi-i = 

0 and Nq~^(v) nWi^0. Moreover from Lemma 12 again, for such i, Nq~^(v) n 
{{W\ U • • • U Wk) — (Wi U Wi+i)) = 0. Hence the mapping conserves weights. A 
maximum weight directed cycle of red edges can be found in polynomial time 
by the breadth first search. □ 



Lemma 14. A maximum weight small augmenting cycle can he found in poly- 
nomial time. 

Proof. The number of alternating cycles of length 4 is polynomially bounded. 
Thus we can easily find one having the maximum weight. On the other hand, 
each alternating cycle of length at least 6 is included in a certain cycle of wings. 
The number of all cycles of wings containing at most two regular vertices is 
polynomially bounded. We can also enumerate these in polynomial time. By 
Lemma 13, we can find a maximum weight small augmenting cycle in polynomial 
time. □ 

Unfortunately, a maximum weight large augmenting cycle cannot be found 
in polynomial time in the same way because the number of the cycles of wings 
having three or more regular vertices cannot be polynomially bounded. Before 
considering the step (2) in our algorithm, we introduce a useful property relative 
to wings around regular vertices. For convenience, we will use some notations as 
below: 

• means that v\ and V 2 are adjacent, and V\^V 2 means v\ and V 2 are 
not adjacent. 

• U2 says there is an edge having plus and minus sings at v\ and V 2 

H — 

respectively, and vi 9^ V 2 is its negation. 

_|_ _| |_ _| 

• vi ~ U2 denotes either vi ~ V 2 or vi ~ U2, and vi 7^ V 2 is the negation of 

Wl ~ V2. 

• V 1 OV 2 says that vi and V 2 are contained in the same wing, and v\ f> V 2 is its 
negation. 



Lemma 15 ([4]). Given a regular vertex x, let B{x) = {v| and v is 

hounded}. Then there exists a partition of B{x), namely [N^(x), N'^(x)], such 
that for any vi,V 2 G B{x) with v\ f>V 2 , 

Ul~U2 [vi,V2 & N^{v) or Vi,V2 & N'^{v)]. 

Moreover this partition is uniquely determined, and hence, it can he found in 
polynomial time. 

This is the key lemma of Minty’s algorithm. If a large alternating cycle or a free 
alternating path passes through v\ G N^(y) and a regular vertex v, then it must 
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pass through a vertex V2 such that V2 G N'^ (v) and V2 />vi. From this property 
Minty showed that by constructing a graph called the “Edmonds’ graph” and by 
finding a maximum weight perfect matching of it, a maximum weight augmenting 
path for any Pareto-optimal stable set can be found in polynomial time. To 
deal with induced weights, we require an additional property of the partition of 
vertices adjacent to a regular vertex. 

Lemma 16. For a regular vertex x and a vertex v such that v = x or v^x, we 
define 

N^{x) N‘^{x) 3 aGN^{x), 3 bGN‘^{x) such that 09^6,0^ v and h ^ v, 

N‘^{x) N'^{x) 3 c&N'^ {x) , 3 d&N'^ { x) such that c^d, v and d v. 

Then at most one of N^{x) N'^{x) and N‘^{x) >~y N^{x) holds. 

H — _l_ 

Proof. Let us consider the case v = x. If b x, then b^x because b^x. In 
addition, if x, then a~^ b. Hence neither N^{x) N^{x) nor N^{x) 

N^(x) holds. 

Suppose to the contrary that V'^x, N^(x) N^{x) and N^{x) N^{x). 

There exist a,d G N^{x) and b,c G N'^{x) such that 09^6, C'/'d, v, b ^ v, 

_l H — 

c ~ V and d ^ V. Note that h, d and v are mutually distinct. Assume to the con- 

_l_ H _l _l_ _l_ 

trary that Then 6 ~ u because b v. But a ~ u and u ~ 6 induce a ^b, 
contradicting a'/^b. Hence b'/^v and similarly dyt^v. Now b^d since otherwise 
{x, b, d, u} induces a claw. Thus bod from Lemma 15. 

Suppose that aoc. Because x is regular, i.e., x is adjacent to at least three 
wings, there exists e G N{x) such that e /> aoc and e />bod. Suppose that e G 

N^{x). Then C'/^b and from Lemma 15. If e u, then replace d by e, and from 
the above discussion, 6oe, a contradiction. Hence v, and we can replace a by 
e. Similarly if e ^ N^(x), i.e., e G N'^(x), then we can replace c by e. Henceforth 
we assume that a /be. 

Suppose to the contrary that a />d. From Lemma 15, a^d. Since a is bounded, 
a is adjacent to two black vertices: x and namely y. Then d'fjy, since otherwise 
d^x and d^y imply aod, a contradiction. Now since otherwise {a,d,v,y} 
induces a claw. Note that y since otherwise v ^y, contradicting the fact 
that y is black and v is white. Thus c"~ v and y induce c"~ y. However, c~a; 
and imply aoc, a contradiction. Hence aod and similarly cob. Since aod, 
dob and boc, aoc holds. However, this contradicts the assumption a f>c. □ 

We add the induced weight of an alternating cycle or a free alternating path 
to weights of appropriate vertices in it. We define zh : (P U (P x P)) ^ 3? by the 
following procedure: let w <— 0 and for each mixed vertex v, 

• if B ^{v) = {m I u is bounded, u is empty or included in a wing, 

w{u) ^ w{u) + w{v) for each u G B '"(u). 
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• otherwise there uniquely exists a black vertex x such that x = v ot X'^v, 
from Lemma 12, 

-k if X is regular, then 

— if N‘^{x) N^{x), then w{u) k- w{u) + w{v) for each u G B ^(w) n 

N'^ix), 

— otherwise w{u) k- w{u) + w{v) for each u G B '"(w) n N^{x), 

k otherwise x must be irregular, and w{t, u) <— w{t, u) + w{v) for each pair 
of vertices G B{x) such that t />u, and v or u]. 

By combining Lemmas 15 and 16, we can prove the next lemma. 

Lemma 17. Let C = (t/i, xi, ?/ 2 , • 7 2 /fc, 2 /fc+i = Hi) be an alternating 

cycle with white vertices yi, . . . ,yk and black vertices x\, . . . ,Xk (fc > 3). Then 

bx(C) = Y!1 =i w{yi) - Y!1 =i w{xi) + w{yi) + Y!1=i y*+i)- 

If there is no small augmenting cycle, by using Lemma 17, we can construct the 
Edmonds’ graph G such that 

1. each edge of G is colored black or white, and it has a weight w, 

2. all the black edges form a perfect matching M of G, 

3. if M is a maximum weight perfect matching of G then there is no large 
augmenting cycle family in G and 

4. if w{M) < w{M*) for a maximum weight perfect matching M* of G, let 
Gi , . . . , Gfc be all the augmenting cycles in M* A M ; then Gi , . . . , Gfe cor- 
respond to a maximum weight large augmenting cycle family Gi, . . . , Gfc in 

G. 

In the next section, we show that the Edmonds’ graph can be constructed in 
polynomial time. Hence the step (2) in our algorithm can be done in polynomial 
time. Analogously, if there is no augmenting cycle, for any pair of vertices a and 
b, we can find a maximum weight augmenting path whose endpoints are a and 
b, if it exists, by constructing the Edmonds’ graph and by finding a maximum 
weight perfect matching in it. Now we can find a maximum weight augmenting 
path by trying all the pairs of vertices a and b. 

5 Constructing and Modifying the Edmonds’ Graph 

We now describe how to construct the Edmonds’ graph to find a maximum 
weight large alternating cycle family. We note that Edmonds’ graphs for finding 
a maximum weight augmenting path can be obtained by modifying the construc- 
tion. 

A white alternating path P is called an irregular white alternating path 
(IWAP) if all black vertices of P are irregular and no wing contains two white 
vertices of P. For an IWAP P = (j/i, zi, j/2, ^27 • • • 7 Zk-i,yk), we define its weight, 
denoted by Sx{P) as 

Sx{P) = Eti + Eti y.+i)- 
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Then Lemma 17 can be described in terms of I WAP: 

Lemma 18. Let C = {P\,xi,P 2 ,X 2 , ■ ■ ■ ,Pk,Xk,Pk+i = P\) be ein alternating 
cycle of length at least 6 such that k >2, x\, . . . ,Xk are distinct regular vertices 
and Pi, . . . ,Pk are IWAPs. Then 

bx{C) = bx{Pi) - Et=i 

Lemma 19. Let A and B be subsets of bounded vertices. Then a maximum 
weight LWAP whose endpoints are in A and B respectively can be found in poly- 
nomial time. 

Proof. We can reduce this problem to find maximum weight directed paths in 
directed acyclic graphs. □ 

Now we make the Edmonds’ graph Gsd. Let x\,...,Xr be all the regular 
vertices. GEd has 2r vertices, namely xl,xf (i = l,...,r). Join x] and xf by 
a black edge with weight iu(xj,xf) = w(xi) (i = l,...,r). For each pair of 
regular vertices Xi and xj and for p,q G {1,2}, if there exists an IWAP whose 
endpoints are in N’P{xi) and N^{xj), join and x^ by a white edge whose 
weight w(x^,Xj) is the maximum weight among such IWAPs. Now we finish 
constructing the Edmonds’ graph. 

Let M be the set of all the black edges. Note that M is a perfect matching. 
An alternating cycle C of length 2fc > 6 in GEd corresponds to a large alternating 
cycle C in G where C has k regular vertices and 6m{G) = 6x{C). 

But this is not true for fc = 2. Let C = {xl,Xj,x‘j,x^,xD be an alternating 
cycle of GEd- Here Xi and Xj are distinct regular vertices. We denote Ppq as 
the maximum weight IWAP corresponding to the edge (xf , x'l) in the Edmonds’ 
graph for p,q G (1, 2}. If 5m{C) is not positive, then there is no problem in our 
purpose. So suppose that its weight is positive, i.e. 6m{G) = Sx{Pii) + Sx{P 22 ) — 
w{xi) — w{xj) > 0 . 

If P\i and P 22 have no vertex in common, then C = (xi, Pu,Xj, P 22 ,Xi) is 
a small augmenting cycle, contradicting to that we have already eliminated all 
the small augmenting cycles. Hence we can denote 

Pii = {y\,zi,y\,Z2,...,y\_i,zt-i,yl) and 
P22 = {yl,zi,yl,Z2,...,yj_i,ze-i,yi). 

Here z\, . . . , Zi-i are irregular vertices, both and y^ are in a common wing 
WkioY k y\ G N'^{xi), yj G N^{xi), y\ G N'^{xj) and y} G N‘^(xj). 

We first discuss an easy situation. A wing W is said to be irregular reachable 
to a regular vertex x if there exist an integer m > 1, distinct irregular vertices 
zi, . . . , Zm-i and distinct wings W\{= W), IF 2 , . . . , Wm such that Wi is adjacent 
to zi and Wk is adjacent to Zk-i and Zu for k = 2,...,m, where Zm = x. Let 
W{xi,Xj) denote the union of all the wings that are irregular reachable to both 
Xi and Xj. 
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Lemma 20. IfN^ {xj) C W{xi, Xj), then any large alternating cycle in G passes 
through neither P\2 nor ^22- That is, we can delete the edges (xjjx'j) and {xf, x'j) 
from Ge<i- Similarly ifN'^fxj) C W{xi,Xj), we can delete (xj,Xj) and {xf,x^). If 
C W{xi,Xj), we can delete {xf,Xj) and (xf,Xj). If N'^{xi) C W{xi,Xj), 
we can delete {x\,x^^) and (xj,x^). 

Proof. Suppose that a large alternating cycle G passes Xi, P12 (or P22) and xj. 
Before Xj, it passes a vertex in N‘^{xj). Hence after xj, it must pass a vertex 
V G N^{xj) C W{xi,Xj). Hence G contains exactly two regular vertices Xi and 
Xj, contradicting to that G is large. □ 

In the sequel, we suppose that none of N^{xi), N‘^{xi), N^{xj) nor N‘^{xj) 
is contained in W{xi,Xj). 

Lemma 21. There exists k such that yl = and 2 < k < £— 1 , or there exists 
k such that yl^yl and 1 < k < £. 

Proof. Suppose that this lemma does not hold, i.e. yl yf y 1 and y\']Iy\ for all 
k = !,...,£ (Note that y\ yf y"l and y\ yf y"j since B^fxi) n B'^{xi) = B^fxj) fl 
B'^{xj) = 0). Let zo = Xi, zi = Xj and Gk = {yl, Zk,yl, Zk-i,yl) {k = !,...,£). 
Then Gk is a small alternating cycle for all k = !,...,£. We can show that 
J 2 k=i Sx{Gk) = Sx{Pii) + Sx{P22) - w{xi) - w{xj){> 0). (The proof is slightly 
complicated because we must consider about the induced weight w.) Hence at 
least one Gk is a small augmenting cycle, a contradiction. □ 

Now we can show the next two lemmas, but proofs are omitted. 

Lemma 22. If£ = 1, any large alternating cycle passes through neither Pu nor 
P22. Hence we can delete the edges {x\,xl) and {x'f,x'j) from Ge<i- 

Lemma 23. If £ > 2 , the followings hold. 

1 . There exists k such that 2 < k < £ — 1 and yl = yl, or there exists k such 

that l<k<£-l,yl^yl, yl^^ yf yl'/^yl+^ and yl'/^yl+^. 

2 . For such k, let 

Pm = {yl,zi,yl,...,zk-i,yl), Piij = {yl+i, Zk+i, ■ . ■ ,y}_i, ze-i,yl), 
P22z = {yl,zi,yl,...,Zk-i,yl) and P22j = (y^+i, Zfc+i, . . . , y|_i, z<?_i, y|), 

and let P{2 = {Pm, Zk, P22j) and P21 = {P22i, Zk, Piij). 

Then Sx{Pi2) + ^^(^2^) = ^x(^’ii) + dx{P22), P\2 an IWAP between 
B^{xi) and B‘^{xj), and P21 is an IWAP between B‘^{xi) and B^{xj). 

3 - Sx{Pll) + Sx{P22) = 5 x[Pi 2 ) + 5 x{P 2 i)- 

4 - 3 x{P{ 2 ) = ^x{Pi2) and Sx{P2i) = 5 x{P 2 i)- 

Summing up the above discussion, dealing with three cases, i.e. Lemmas 20, 
22 and 23, we modify the Edmonds’ graph. In the first two cases, elimination 
of augmenting cycles of a form {x\ , x}j , xj , xf , x}) or {xj , xj , xj , xf , x}) can be 
easily done by deleting edges. In the last case, we modify Gsd as below: 
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1. Delete four edges (x^,x^), (xl,Xj) and (xf,xj) (Lemma 23 guaran- 

tees the existence of these four edges), 

2. Add two new vertices zl and z^, join zl and z^ by a black edge and assign 
its weight io((zl, zl)) to be 0, where k satisfies the conditions of Lemma 23, 

3. Add four white edges (xl,zl), (xf, zl), (xj,zl) and (xpzl), and assign their 

weightstobe = h(Pii), w((xf,zl)) = 6x{,P22), w{{x],zD) = 0 

and w{{x'j,zl)) = 6x{Pi2) ~ Sx{Pii){= 5x{P22) ~ 5x{P2i))- 

All large alternating cycles through black edges {x},xl) and (xj,x^) can 
be preserved by our revision, because (x^jX^) in the original Edmonds’ graph 
(p, q G {I, 2}) is interpreted by the path (xf, zl, zl, x^) in the revised Edmonds’ 
graph. Furthermore, Lemma 23 guarantees that weights of these four edges are 
equal to those of such four paths, respectively. 

Lemma 24. A maximum weight large alternating cycle family can be found in 
polynomial time if there is no small augmenting cycle. 

Proof. Make the Edmonds’ graph. Then eliminate all the augmenting cycles of 
a form {xl,Xj,x‘j,xl,xl) or {xl,x‘j,Xj,xl,x}). Let be the modified graph 
and M' be the set of its black edges. Note that M' is perfect. Let M* be a 
maximum weight perfect matching and Gi , . . . , Gfe be all the augmenting cycle in 
M' A M* (k may be zero). Note that (IJ^i ^'i) is a maximum weight alternating 
cycle family of G^^. Then each Ci has length at least 6 because we eliminate all 
augmenting cycles of length 4, and hence Ci corresponds to a large augmenting 
cycle Ci of X such that 5x{Ci) = SM'{Ci). Moreover Gi,...,Gfc are disjoint 
because C\, . . . ,Ck are vertex-disjoint. Now from construction and modification 
of the Edmonds’ graph, we can conclude that (IJ^i P'i) i® ^ maximum weight 
large alternating cycle family of A. □ 
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Abstract. A simple proof is presented for the min-max theorem of 
Lovasz on cacti. Instead of using the result of Lovasz on matroid parity, 
we shall apply twice the (conceptionally simpler) matroid intersection 
theorem. 



1 Introduction 

The graph matching problem and the matroid intersection problem are two well- 
solved problems in Combinatorial Theory in the sense of min-max theorems and 
polynomial algorithms for finding an optimal solution. The matroid parity prob- 
lem, a common generalization of them, turned out to be much more difficult. For 
the general problem there does not exist polynomial algorithm [2] , [3] . Moreover, 
it contains NP-hard problems. On the other hand, for linear matroids Lovasz 
[3] provided a min-max formula and a polynomial algorithm. There are several 
earlier results which can be derived from Lovasz’ theorem, e.g. Tutte’s result 
on /-factors [9], a result of Mader on openly disjoint A-paths [5], a result of 
Nebesky concerning maximum genus of graphs [6]. Another application which 
can be found in the book of Lovasz and Plummer [4] is the problem of cacti. It 
is mentioned there that ”a direct proof would be desirable.” Our aim is to fill 
in this gap, that is to provide a simpler proof for this problem. We remark here 
that we shall apply the matroid intersection theorem twice. We refer the reader 
to [7] for basic concepts of matroids. 

A graph K is called cactus if each block (maximal 2-connected subgraph) of 
AT is a triangle (cycle of length three). The size of a cactus K is the number 
of its blocks. Lovasz derived a min-max theorem for the maximum size of a 
cactus contained in a given graph G from his general min-max theorem on linear 
matroid parity problem. Here we shall give a simple proof for this result on cacti. 
The proof follows the line of Gallai’s (independently Anderson’s [1]) proof for 
Tutte’s theorem on the existence of perfect matchings. 

In fact, we shall solve the graphic matroid parity problem in the special case 
when for each pair the two edges have exactly one vertex in common. The graphic 
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Grenoble. 

R. E. Bixby, E. A. Boyd, and R. Z. Rios-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 84-95, 1998. © Springer-Verlag Berlin Heidelberg 1998 




On a Min-max Theorem of Cacti 



85 



matroid parity problem is the following. Given a graph G and a partition of its 
edge set into pairs, what is the maximum size of a forest which consists of pairs, 
in other words, what is the maximum number of pairs whose union is a forest. 
A pair of edges is called v-pair if these two edges have exactly one vertex in 
common and they are not loops. If G is an arbitrary graph and V is a partition 
of the edge set of G into v-pairs then (G, V) is called v-graph. From now on a 
cactus of (G, V) is a forest of G consisting of v-pairs in V. The size of a cactus 
is the number of v-pairs contained in it. The v-graphic matroid parity problem 
consists of finding the maximum size f3{G, V) of a cactus in a v-graph (G, V). 

The original cactus problem can be formulated as a v-graphic matroid parity 
problem as follows. Let (G',V) be the following v-graph. The vertex set of G' 
is the same as of G. We define the edge set of G' and the partition V of the 
edge set into v-pairs as follows. For each triangle T of G we introduce a v-pair 
in (G', V): choose any two edges of T, add them to the edge set of G' and add 
this v-pair to V. (G' will contain lots of parallel edges. In fact, G' is obtained 
from G by multiplying edges.) Obviously, there is a one to one correspondence 
between the cacti of G and the forests of G' being the union of v-pairs. Thus 
the problem is indeed a v-graphic matroid parity problem. 

To state the theorem on cacti we need some definitions. Let (G, V) be a v- 
graph. Let V := { Vi, V 2 , Vi} be a partition of the vertex set V (G). Let V-p C V 
{S-p C V) be the set of those v-pairs whose end vertices belong to three (two) 
different members of V. Let Q := {Hi, H 2 , ■■■, Hk} be a partition of Vp U Sp. 
Let us denote by p{Hi) the number of V^’s for which there exists at least one 
v-pair in Hi with a vertex in Vj. We say that {V, Q) is a cover of (G, V). The 
value val{V, Q) of a cover is defined as follows. 

val{V,Q) :=n-l+Y^ 

HiGQ 

where n = |G(G)|, I = \V\ and k = |Q|. 

Now, we are able to present the min-max result of Lovasz [4] on cacti in our 
terminology. 

Theorem 1. Let (G,V) he a v-graph. Then (3{G,V) = mm{val{V, Q)}, where 
the minimum is taken over all covers {V, Q) o/(G, V). 

Remark 1. In the definition of a cover, Q could be defined as the partition of V 
and not of VpU5p. Indeed, if {V, Q) is a cover, then the pairs in V — VpU5p can 
be added to Q as new members of the partition without changing the value of the 
cover since p{T) = 1 for all T G V— VpU5p. We mention that if T G V — VpU5p, 
then T can not be in a maximal cactus. 

A cactus of (G, V) is called perfect if it is a spanning tree of G. If G contains 
only one vertex v and no edge, then the vertex v is considered as a perfect cactus 
of (G, V). Since a spanning tree contains n — 1 edges, (3{G, V) < ; for any 

v-graph (G, V). 
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Let {V, Q) be a cover of a v-graph (G, V). The elements Hi G Q are called 
components of the cover. G /V will denote the graph obtained from G by con- 
tracting each set Vi in "P into one vertex. We identify the edge sets of G and 
G/V. (G-p,V-p) is the v-graph, where Vv is defined as above, it contains those 
v-pairs of V which remain v-pairs after the contraction, the vertex set of G-p is 
the same as of GjV and the edge set of Gp is the set of edges of the v-pairs in 
Vp, that is, Gp is obtained from G/P by deleting the edges which do not belong 
to any v-pair in Vp. For Hi G Q, (G-p [Hi], Hi) will denote the v-graph for which 
the edge set of Gp [Hi] is the set of edges of the v-pairs in Hi and the vertex set 
of Gp[Hi] contains those vertices of Gp for which at least one v-pair of Hi is 
incident. Then p(Hi) is the number of vertices of Gp[Hi], (Note that if Hi € Sp, 
then (Gp[Hi], Hi) contains two edges which are parallel or one of them is a loop, 
that is it is not really a v-graph. However, we shall need this type of ”v-graphs” 
in the proof.) If P is a subset of edges of a v-graph (G, V) then the number of 
v-pairs of V contained in F is denoted by vv(F). 

For a graph G on n vertices and with c connected components, a forest of G 
containing n — c edges is called spanning. For a connected graph G, a forest F of 
G containing n — 2 edges (that is, F has exactly two connected components) is 
called almost spanning. A v-graph will be called (cactus) -critical if by identifying 
any two vertices the v-graph obtained has a perfect cactus. Especially, this means 
that in a critical v-graph there exists a cactus which is almost perfect, that is, 
it is an almost spanning tree consisting of v-pairs. Critical v-graphs will play an 
important role in the proof, like factor-critical graphs play the key role in the 
proof of Tutte’s theorem. A component Hi G Q is said to be critical in (G, V) if 
the v-graph (Gp[Hi], Hi) is critical. If Hi G Sp, then (Gp[Hi], Hi) is considered 
to be critical. 

We say that the partition 7^ of V is the trivial partition if I := [V] = n := [V[. 
The cover (V,Q) is the trivial cover if Z = n and fc := |Q| = 1. Let V = 
{V-f,...,V{^,Vf,..., Vp , ...,Vi^, ...,Vp}, where UjVf = Vi for all i, then the 
partition V' is called a refinement of the partition 7^. If 7^' is a refinement of V 
so that [V'] = |7^| -I- 1, then we say it is an elementary refinement. IfViGV then 
the partition obtained from V by replacing Vi by its singletons will be denoted 
by 7^-^ {Vi}. If V is a refinement of V, then we shall use p'(Hi) instead of p(Hi). 

We shall need later two auxiliary graphs B and D. These graphs will depend 
on a v-graph (G, V) and a cover (V, Q) of this v-graph. We suppose that for 
each component Hi, p(Hi) is even. First we define the graph B = (V (G), E(B)). 
e = uv will be an edge of B if and only if there exist u,vGVjGV, Hi gQ and 
a cactus K in (Gp^{Vj}[Hi], Hf) consisting of p(Hi)/2 v-pairs so that exactly 
two vertices u and v of Vj are connected in K, not necessarily by an edge but by 
a path in K, that is u and v are in the same connected component of K. (Note 
that K contains a cactus of size (p(Hi) — 2) /2 in (Gp[Hi], Hi). We mention that 
(by Lemma 2, see later) (Gp[Hi], Hi) will always contain a cactus consisting of 
(p(Hi) — 2)/2 v-pairs of V.) We call this edge e an augmenting edge for Hi. In 
other words, the trace of the cactus 7F in 7^ is the edge e. We will call the edges 
of B as augmenting edges. Note that an edge of B may be augmenting for more 
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Hi G Q. Let V' be a refinement of V. Then the set A-pi of augmenting edges 
connecting vertices in different sets of V' will be called the augmenting edges 
with respect to the refinement V' . 

The second auxiliary graph D will be a bipartite graph with colour classes 
E{B) (the edge set of B) and Q. Two vertices e G E{B) and Hi € Q are 
connected if and only if e is an augmenting edge for Hi. As usually, the set of 
neighbours of a vertex set X of one of the colour classes of D will be denoted by 
Ed{X). 

Finally, some words about the ideas of the proof. As it was mentioned earlier 
we shall follow the proof of Tutte’s theorem. Let us briefly summarize the steps 
of this proof. We suppose that the Tutte condition is satisfied for a given graph 
G and we have to construct a perfect matching of G. Let A be a maximal 
set satisfying the condition with equality. The maximality of X implies that 
all the components of G — A are factor-critical, thus it is enought to find a 
perfect matching in an auxiliary bipartite graph D, where one of the color classes 
corresponds to A while the other to the (critical) components. Hall’s theorem 
(or the matroid intersection theorem) provides easily the existence of a perfect 
matching M in D. The desired perfect matching of G can be obtained from M 
and from the almost perfect matchings of the critical components. We mention 
that this is a lucky case because the union of these almost perfect matchings will 
be automatically a matching in G. 

In the case of cacti it is not easier to prove that version where we have to find 
a perfect cactus, so we shall prove directly the min-max theorem. We shall choose 
a minimal cover {V, Q) of (G, V) which is maximal in some certain sense. This 
will imply that the minimal cover of {G-p[Hi\, Hi) is unique for each component 
Hi. This fact has two consequences, namely (i) each component Hi is critical 
(hence p{Hi) is even) and {ii) for any component Hi and for any refinement V' 
of V, either there exists an augmenting edge for Hi with respect to V' or its 
cover rests minimal in (Gp/ iLj). 

We shall construct the cactus of size val{V, Q) in (G, V) as follows. (1) For 
n — I components Hi, we shall find a cactus Ki in {G-p^^Vj}[Hi\, Hi) of size 
p{Hi)/2 so that the trace of Ki in V is an edge and the corresponding augmenting 
edges form a spanning forest of the auxiliary graph B. (We shall see that the size 
of a spanning forest of B is indeed n—l.) (2) For the other components Hj we shall 
need a cactus in {G-plnj], Hj) of size p{Hj)/2—l, and (3) the union of all of these 
forests will be a forest, that is a cactus of size val{V, Q). Using (i), for the latter 
components Hj it is enough to find an arbitrary almost spanning tree in G-p [Hj] 
(and then using that Hj is critical, this forest can be replaced by a convenient 
cactus containing the same number of edges, that is of size p{Hi)/2 — 1). By 
the definition of augmenting edge, for the former components Hi it is enough to 
consider an arbitrary spanning tree in Gp[Hi] so that (*) there exist augmenting 
edges for these components whose union will be a spanning forest of B. Thus we 
have to find a forest F in G so that (a) E{F) n E{Gp[Hj]) is either a spanning 
tree or an almost spanning tree in Gp[Hj], (b) for n — I components Hi we have 
spanning tree, (c) for these components (*) is satisfied. 
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The existence of a forest with (a) and (b) can be proved, using (ii), by a 
matroid partition theorem (for a graphic matroid and a truncated partitional 
matroid). We shall see in Lemma 5 that if for all such forests we consider the 
components where the corresponding forest is a spanning tree then we get the 
set of basis of a matroid on the set of indices of the components. 

Two matroids will be defined on the edge set of the auxiliary graph D, one 
of them will be defined by the above introduced matroid, and the other one will 
be defined by the cycle matroid of B. The matroid intersection theorem will 
provide a forest of G with (a), (6) and (c). As we mentioned earlier, each part of 
the forest, which corresponds to a component, can be replaced by a convenient 
cactus, and thus the desired cactus has been found. 

2 The Proof 

Proof, (max < min) Let F be an arbitrary cactus in (G, V) and let {V , Q) be any 
cover of (G, V). Contract each Vi & V i = 1,2, . . . ,l into a vertex and let F' be a 
subset of F of maximum size so that F' is a forest in the contracted graph G/V . 
For the number c (c') of connected components of F’ in G (of F' in G/V) we have 
obviously, c' < c. Thus \F\ = n — c < n — c' = {I — c') + {n — 1) = \F' \ + {n — 1) . It 
follows that vv{F) < vv{F') + n — 1. Let F" be the maximum subforest of F' in 
G-p consisting of v-pairs in V. Obviously, F” forms a cactus in each (Gp[iFi], 

Hi G Q. By definition, IJ// covers all the v-pairs contained in F” . Thus 

W{F') = vv{F'') = wAF") = Eh^^qVhAF") < and the 

desired inequality follows. □ 

Proof, (max > min) We prove the theorem by induction on the number n of 
vertices of G. For n = 3 the result is trivially true. 

Let {V, Q) be a minimum cover of (G, V) for which I is as small as possible 
and subject to this k is as large as possible. Note that by the maximality of k, 
each pair in S-p will form a component because for each Hi G Sp, ~ 

Lemma 1. For each Hi G Q, the unique minimum cover of {Gp[Hi\, Hi) is the 
trivial one. 

Proof. Let {V , Q') be a minimum cover of (Gp[Hi],Hi). Clearly, valiV' , Q') < 
|^ p(gi)-i j ^ Using this cover, a new cover {V*,Q*) of (G, V) can be defined as 
follows. Let the partition V* of V (G) be obtained from V by taking the union 
of all those W and Vs whose corresponding vertices in Gp are in the same set 
of V . Then I* = I — p{Hi) + I', where I' = \V'\. Let Q* be obtained from Q by 
deleting Hi and by adding Q' . We claim that the new cover is also a minimum 
cover. 



-i*+ Y. 



P{Hj) - 1 
2 



+ E 

H'.gQ' 



p'jH') - 1 
2 



val{V*,Q*) <n 
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= n-l*+ V 

2 

+ 0!) - {pm - 1')) 

<n-{l-p{H,) + l')+ 

p{Hi) - 1 



p{H,) - 1 



- p{Hi) + 1' = val{V, Q). 



It follows that equality holds everywhere, so val{V', Q') = J , thus the 

trivial cover of {G-p[Hi\, Hi) is minimal. Furthermore, by the minimality of I, V' 
is the trivial partition of V {G-p [Hi]) and by the maximality of k, Q! may contain 
only one set and we are done. □ 



Lemma 2. Each component Hi & Q is critical. 

Proof. Suppose that there exists a component Hi G Q for which {Gp[Hi\, Hi) 
is not critical, that is there are two vertices a and b in Gp[Hi] so that after 
identifying a and b the new v-graph (G',V') has no perfect cactus. By the hy- 
pothesis of the induction, it follows that there is a cover (P', Q') of (G',V') so 
that in G' val{Vf Q') < . This cover can be considered 

as a cover {V", Q") of {Gp[Hi], Hi) and val{'P” , Q") = val{V , Q') + 1. Thus 
val{V",Q") < \, that is {V",Q") is a minimal cover of {Gp[Hi],Hi) 

but not the trivial one (a and b are in the same member of V"), which contra- 
dicts Lemma 1. □ 



Corollary 1. If Hi G Q and a,b are two vertices of Gp[Hi], then there exists 
an almost perfect cactus K in {Gp[Hi],Hi) so that a and b belong to different 
connected components of K. 

Proof. By Lemma 2, Hi is critical, so by identifying a and b in Gp[Hi], the 
v-graph obtained has a perfect cactus K. Clearly, K has the desired properties 
m{Gvm,H,). □ 



Remark 2. By Corollary 1, for any component Hi the v-graph {Gp[Hi\, Hi) (and 
consequently (G, V)) contains a cactus containing j y-pairs. However, at 

this moment we can not see whether we can choose a cactus containing 
v-pairs for all Hi so that their union is a cactus as well. Note that by Corollary 
1, p{Hi) is even for each component Hi G Q, that is J = 



Proposition 1. If I = n — l,k = 1 and val{V, Q) = 1 + {G, V) has 

a perfect cactus. 
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Proof. Let (uv,vw) be one of the v-pairs in V. Let us consider the following 
cover {V', Q') of (G, V). Each set of V contains exactly one vertex of G except 
one which contains u and v, and Q' contains exactly one set H (containing all 
v-pairs in V). Then, clearly, this is a minimum cover. By the assumptions for I 
and k this cover also minimizes I and maximizes k, thus by Lemma 2, its unique 
component H is critical, that is the v-graph {G-p',V-p') is critical. Let E be a 
perfect cactus of the v-graph obtained from {G-p' ,V-p') by identifying v and w. 
Obviously, F UuvUvw is a, perfect cactus of (G, V) and we are done. □ 



Lemma 3. Let V' he a refinement ofV and let Hi G Q for which Hi ^ ru{Ap:). 
Then the trivial cover is a minimal cover of {Gpi[Hi],Hi) with value j ^ 

Proof. By Corollary 1, (Gp[Hi],Hi) (and consequently {Gp'[Hi\, Hi)) contains 
a cactus of size j ^ hence the value of a minimum cover of {Gpi[Hi], Hi) 

is at least j ^ Thus what we have to show is that the trivial cover has 

value \ . 

First, we prove this when V' is an arbitrary elementary refinement of V, say 
= Vj. We shall denote the vertices of Gp'[Hi] corresponding to and 
by v\ and V 2 . In this case we have to prove the following. 

Proposition 2. v\ or V 2 does not belong to Gp'[Hi\. 

Proof. Hi P]j{Api) implies that there exists no augmenting edge for Hi with 
respect to V that is (G-p/ [Hi], Hi) has no perfect cactus. By Proposition 1, we can 
use the induction hypothesis (of Theorem 1), that is there exists a cover {V" , Q") 
of {Gpi[Hi\, Hi) so that valifP" , Q") < _ 1 = This cover 

gives a cover {V* , Q*) of {Gv[H,],H,) with val{V*, Q*) < So {V*, Q*) 

is a minimum cover of {Gp[Hi], Hi) and by Lemma 1, it is the trivial cover. 
Moreover, vi and V 2 are in different sets of V" (otherwise, val{V* , Q*) < , 

a contradiction), hence {V* , Q*) is the trivial cover of (Gp/ [Hi], Hi) and its value 
is . It follows that vi or V 2 is not a vertex in Gp'[Hi] and the proposition 

is proved. □ 

Let -P' = {Vf,...,Vf\Vf,...,Vf^,...,Vi^,...,Vp} where = V, for all i, 
be a refinement of P. It is enough to prove that for all i where > 2 there exists 
an elementary refinement P* of P with Vi = Vf U {Vi — Vf) for some 1 < j < 
so that in Gp* [Hi] the vertex corresponding to T) — Vf is isolated. Applying 
Proposition 2, at most times, we see that such an elementary refinement 
exists indeed. □ 



Corollary 2. The vertex sets of the connected components of the graph B (de- 
fined by the augmenting edges) are exactly the sets in P. 
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Proof. By definition, there is no edge of B between two different sets of V. Let 
us consider an elementary refinement V' of V. If there was no augmenting edge 
with respect to this refinement, then by Lemma 3, the value of the cover {V' , Q') 
would be val{V , Q) — 1, where Q' is obtained from Q by adding the elements of 
S-p' — Sp as new members of the partition, contradicting the minimality of the 
cover (7^, Q). This implies that the subgraphs of B spanned on the sets Vi in V 
are connected. □ 

Let Fi be an arbitrary spanning tree of Gp\Fli] for all Hi G Q (by Corollary 1, 
Gp[Hi] is connected). Then E{Fi) n E{Fj) = 0 if f j because the components 
of Q are disjoint. Let W = (V(Gp), E(W)) where E(W) := if(Fi). 

Let V' be a refinement of V with \V'\ = V . Let Q\ := Fd{Ap^) and Q 2 := 
Q — Qi- We define two matroids on E{W). Let Q be the cycle matroid of W 
with rank function rg, that is the edge sets of the forests are the independent 
sets. Let Fp' := F 1 +F 2 (direct sum), where Fj will be the following (truncated) 
partitional matroid (with rank function Vj) on Ej := ,E{F,) J = 1,2. 

Let Fi contain those sets E C Ei for which \F n E{Ei)\ < 1 for all i and the 
intersection can be 1 at most to := |Qi| — {I' — 1) times. Let F 2 contain those 
sets E C E 2 for which \F n E{Fi)\ < 1 for all i. For the rank function r' of Fpi 
F{X) = n{xn El) + r 2 {x n E 2 ). 

Lemma 4. For any refinement V' ofV, E{W) can he written as the union of 
an independent set in Q and an independent set in Fpi. 

Proof. This is a matroid partition problem. It is well-known (see for example [7]) 
that the lemma is true if and only if for any Y C EfW), |T| < rg{Y) + r'{Y). 
Suppose that this is not true, and let T be a maximum cardinality set violating 
the above inequality. Then, clearly, Y is closed in Fpi . Thus Y can be written in 
the form Y = UffiSg* E{Fi), for some Q* C Q. Let Ki, . . . , Kche the connected 
components of the graph K* on the vertex set V{Gp) with edge set Y. Then 
rg{Y) = l-c = EWKj) - 1). Let t := \Q* n Qi|. 

Let Q” be obtained from Q by taking the unions of all those Hm and Hm' 
in Q* for which F^ and Fm' are in the same connected component of K* , that 
is each member H” G Q" — Q corresponds to some Kp so p{H") = p{Kj). 

Case 1. t < to. Then r'{Y) = |Q*|. Let us consider the cover {V, Q") of (G, V). 
Since 0 < val(V, Q") - val(V, Q) = Ei 

|F|= ^ (p(H,)-l)=2 + 

HiGQ* Hi&Q- 

" p{H'f) - 1 

< + m=rg{Y)+ r'{Y) 

1 

contradicting the assumption for Y. 

Case 2. t > to. Now, by the closedness of Y in Fp', Y contains all the trees Fi 
for which Hi G Qi. Thus r'(Y) = n(Y nEi) + r 2 {Y HE 2 ) = to + (|S*|- |Qi|) = 
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|Q*| — (r — 1). Let us consider the following cover ("P', of (G, V), where 
is obtained from the above defined Q" by adding each element in S-pi — S-p as 
a component and adding the v-pairs in Vv — Vp to appropriate members of 
Q” . If L G Vpi — Vp, then L corresponds in VL to a vertex or an edge and in 
the latter case L G Qi so L corresponds to an edge of a connected component 
Kj of K* . We add L to the member H” of Q” corresponding to Kj. (If Kj is 
an isolated vertex, then the corresponding H” of Q” was empty earlier.) Now, 
{V , is a cover of (G, V) indeed. We shall denote the members of — Q by 

Hj 1 < j < c. Clearly, < I' . By Lemma 3, the value of the new cover 

is the following. 



^ q3) = n _ i' + J2 

<n — I' + 



p'{Hf) - 1 



1 

r -c 



E 

HieQ-Q- 



+ E 

HieQ-Q- 
p{Hi) - 2 



p{H,) - 2 



Using that val{V, Q) < val{V , Q^) we have the following inequality. 
|U|= ^ {p{H,)-l)=2 ^ p!Mz1 + \q*\ 

HiGQ* HiGQ- 

<{l-c) + {\Q*\-{l' -l)) = rg{Y) + r'{Y), 



contradicting the assumption for Y. The proof of Lemma 4 is complete. □ 

By Lemma 4, for the trivial partition V' of U(G), the following fact is im- 
mediate. 



Corollary 3. There exists a forest F in the graph W so that for n — I indices i 
E{Fi) C E{F) and E{F) n E{Fi) is an almost spanning tree of Gp[F[i] for the 
other indices. □ 



We shall need the following claim whose proof is trivial. 

Proposition 3. Let E be a forest on a vertex set S. Let F' be a subgraph of 
F with two connected components F{ and F^. If F{ and F 2 belong to the same 
connected component of F then let us denote by a and b the two end vertices 
of the shortest path in F connecting F[ and F 2 , otherwise let a G U(F() and 
b G V{F 2 ) be two arbitrary vertices. Let F" be any forest on V(F') with two 
connected components so that a and b are in different connected components of 
F". Then {F — E{F')) U E{F") is a forest on S. □ 



Remark 3. By Corollary 3, there exists a forest F inW and consequently in Gp 
so that E{F) n E{Fi) is a forest with two connected components on V{Gp[Hi\) 
for all components Hi. Let Hi be an arbitrary component of Q. By Corollary 
1, for the two vertices a and b defined in Proposition 3 {F' = E{F) n E{Fi)), 




On a Min-max Theorem of Cacti 



93 



there exists an almost perfect cactus K in {G-plIIi], Hi) so that a and b belong to 
different components of K. Then, by Proposition 3, F — (E{F) n E{Fi)) U E{K) 
is a forest. We can do this for all components, so the v-graph (G, V) contains a 
cactus containing v-pairs. 

Now we define a matroid {Q,M) on the sets of Q. Let Q' C Q be in At if 
and only if there is /j G E{Ei) for each Hi € Q — Q' so that E(W) — Ufi is a 
forest in W. 

Lemma 5. (Q,M) is a matroid. 

Proof. We show that Ai satisfies the three properties of independent sets of 
matroids. 

(1) By Lemma 4, for V' = V , % & Ai. 

(2) If Q" C Q' G At, then Q" G Ai because any subgraph of a forest is a forest. 

(3) Let Q', Q” G Af so that \Q”\ < \Q'\. By definition, there are /' G E{Ei) 

for H,gQ-Q' and /" G E{Ei) for Hi € Q - Q" so that T' := E{W) - U/' 
and T" := E{W) — U/" are forests in W. Choose these two forests T' and T” 
so that they have edges in common as many as possible. \Q"\ < \Q'\ implies 
that T' has more edges than T” has. T' and T” are two independent sets in the 
matroid Q thus there is an edge e G T' — T" so that T" U e is also a forest in 
W. Then, clearly, e = /" for some i. If e € E(Fi) with Hi ^ Q' then replace /" 
by // and the new forest T* with T' contradicts the assumption on T' and T". 
Thus e G A(Fj) so that Hi G Q' and then obviously Q" {Hi} G Ai and we are 
done. □ 

We shall apply the matroid intersection theorem for the following two ma- 
troids on the edge set of the graph D. For a set Z C E{D), let us denote the 
end vertices of Z in the colour class E{B) (Q) by Zi {Z 2 , respectively). The 
rank of Z in the first matroid will be rs(Zi) and r^(Z 2 ) in the second matroid, 
where rs is the rank function of the cycle matroid of the graph B and is the 
rank function of the above defined matroid A4. Note that if a vertex a: of G is 
in the colour class E{B) (Q) then the edges incident to x correspond to parallel 
elements of the first (second) matroid. 

Remark 4 .. By Corollary 2, rB{E{B)) = n — l and by Corollary 3, r_\ 4 {Q) > n — l. 
Moreover, if A-p’ is the set of augmenting edges of some refinement V' of V, then 
by Lemma 4, 



I' — I < TMiFoiApi)). 



( 1 ) 



Lemma 6. There exists a common independent set of size n — I of these two 
matroids. 

Proof. By the matroid intersection theorem (see for example [7]) we have to 
prove that for any set Z C E{D) (-I-) n — l< tb^E^D) — Z) + tm(Z). 
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Suppose that there is a set Z violating (+). Clearly, we may assume that 
E{jy) — Z is closed in the first matroid. This implies that there is a set J C E{B) 
so that E{D) — Z is the set of all edges of D incident to J and J is closed in 
the cycle matroid of B. Let us denote by C/, C 2 , the vertex sets of the 
connected components of the graph on vertex set V{B) with edge set J. Then 
by the closedness of J, E{B) — J is the set of augmenting edges of the refinement 
V' := {¥{,¥ 2 , ■■■,¥(,} of V, that is, A-p/ = E{B) — J. (Obviously, Z is the set 
of all edges incident to E{B) — J in D.) Then tm{Z) = rM{rD{Av)) and 
tb{E{D) — Z) = rB{J) = n — I'. By (1), V — I < r^(r£i(A-p/)) and thus 
n — I = {V — 1) + {n — V) < rj\^(Z) + rB{E{D) — Z), contradicting the fact that 
Z violates (+). □ 

The Construction of the Desired Cactus. Let N C E{D) be a common 
independent set of size n — 1. (By Lemma 6, such a set exists.) It follows that 
is a matching in D so that it covers a basis E' in the cycle matroid of B and 
an independent set Q' in A4. Thus there exists a forest F' on ¥{Gp) so that it 
spans the spanning trees Fi in Gp\Hi] for Eli G Q' and almost spanning trees 
Fi — fi in Gp[F[i] (for appropriate fi) for Eli G Q — Q' . Let us denote by c the 
number of connected components of F' . Clearly, E' U E{F') is a forest on ¥ (G) 
containing 2(n — I + Y1,h &Q edges and it has c connected components. 

{\E'UE{F')\ = \F'\ + \E(F')\=n-l + j:H,^Q'iPm-l) + EH,eQ-Q'iPm- 
2) = 2(n — 1) + ~ 2)-) 

We shall change the trees and forests by appropriate ones obtaining a cactus 
of the desired size. As in Remark 3, for each Eli G Q — Q' we niay replace in F' 
Fi — fi by an almost perfect cactus in Gp[Eli] obtaining a forest F” on ¥{Gp) 
with the same number of edges. As above, E' U E{F") is a forest on ¥{G). For 
all e G if' e is an augmenting edge for He G Q', where He is the pair of e 
in the matching N. Thus there exists a cactus Ke in {Gp^Vi\He]i He) of size 
p{Hi)/2 so that the trace of Ke in ¥i is the edge e, where ¥i G V contains the 
edge e. (Note that each Ke corresponds to a connected graph Fg in Gp[He].) 
Replace F{Fi) by Uees' F{Ke). We obtain again a forest of G with 

the same number of edges. (Indeed, first in F” we replace U/ZiGg' F{Fi) by 
UeeB' F{F'e) and obviously we obtained a graph with c connected components, 
and, clearly, the edge set of this graph corresponds to a subgraph of G with c 
connected components. Since the number of edges in this subgraph is the same 
as in E' U E{F') it is a forest of the same size.) The forest obtained consists of 
v-pairs, that is it is a cactus with size n — l + X^g^gg J ■ 



Remark 5. While I was writing the final version of this paper I realized that the 
same proof (after the natural changes) works for the general graphic matroid 
parity problem. The details will be given in a forthcoming paper [8]. 



Acknowledgement. I am very grateful to Gabor Bacso for the fruitful discus- 
sions on the topic. 
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Abstract. Let G — (V,E) be a fc-edge-connected mnltigraph with a 
designated vertex s € U which has even degree. A splitting operation 
at s replaces two edges {s, u) and (s, v) incident to s with a single edge 
(u,v). A set of splitting operations at s is called complete if there is no 
edge incident to s in the resulting graph. It is known by Lovasz (1979) 
that there always exists a complete splitting at s such that the resulting 
graph G' (neglecting the isolated vertex s) remains fc-edge-connected. In 
this paper, we prove that, in the case where G is planar and k is an even 
integer or k — 3, there exists a complete splitting at s such that the 
resulting graph G' remains fc-edge-connected and planar, and present an 
0(|U|® log |U|) time algorithm for finding such a splitting. However, for 
every odd k > 5, there is a planar graph G with a vertex s which has 
no complete splitting at s which preserves both fc-edge-connectivity and 
planarity. As an application of this result, we show that the problem 
of augmenting the edge-connectivity of a given outerplanar graph to an 
even integer fc or to fc = 3 can be solved in polynomial time. 



1 Introduction 

Let G = (y, E) stand for an undirected multigraph, where an edge with end 
vertices u and v is denoted by (u,v). For a subset^ S' C U in G, G[S] denotes 
the subgraph induced by S. For two disjoint subsets X,Y C V, we denote by 
Ea{X,Y) the set of edges {u,v) with u G X and v G Y, and by cg{X,Y) 
the number of edges in Eg{X,Y). The set of edges Eg{u,v) may alternatively 
be represented by a single link (u,v) with multiplicity cg{u,v). In this way, 
we also represent a multigraph G = (V,E) by an edge- weighted simple graph 
N = (y, Lg, cg) (called a network) with a set V of vertices and a set Lg of links 
weighted by cg '■ Lg ^ where is the set of non-negative integers. We 

denote \V\ by n, \E\ by e and \Lg\ by m. A cut is defined as a subset A of y 

^ A singleton set {x} may be simply written as x, and “ C ” implies proper inclusion 
while “ C ” means “ C ” or “ = ” . 

R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 96-111, 1998. © Springer-Verlag Berlin Heidelberg 1998 
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with 0 ^ X ^ y, and the size of the cut X is defined by cc{X, V — X), which 
may also be written as cg{X). If X = {a;}, cg{x) denotes the degree of vertex x. 
For a subset X C V, define its inner- connectivity by Ag(X) = min{cG(X') | 0 ^ 
X' C X}. In particular, AG(y) (i-C., the size of a minimum cut in G) is called 
the edge- connectivity of G. For a vertex v & V , a, vertex u adjacent to v is called 
a neighbor of v in G. Let Faiv) denote the set of neighbors of v in G. 

Let s G y be a designated vertex in V . A cut X is called s-proper if 0 yf 
X C V — s. The size Xg{V — s) of a minimum s-proper cut is called the s- 
based-connectivity of G. Hence AG(y) = min{AG(y — s),cg(s)}- A splitting at 
s is (A:, s)-feasible if AG'(y — s) > k holds for the resulting graph G' . Lovasz [6] 
showed the following important property: 

Theorem 1 ([2,6]). Let a multigraph G = (V,E) have a designated vertex 
s G y with even cg(s), and k be an integer with 2 <k < XGiV — s). Then there 
is a complete {k, s)- feasible splitting. □ 

Since a complete (fc, s) -feasible splitting effectively reduces the number of ver- 
tices in a graph while preserving its s-based-connectivity, it plays an important 
role in solving many graph connectivity problems (e.g., see [1,2,9]). 

In this paper, we prove an extension of Lovasz ’s edge-splitting theorem, aim- 
ing to solve the edge-connectivity augmentation problem with an additional con- 
straint that preserves the planarity of a given planar graph. Firstly, we consider 
the following type of splitting; for a multigraph G = (V, E) with a designated 
vertex s, let Eg{s) = {wq, wi, . . . , Wp_i} {p = |/g(s)|) of neighbors of s, and 
assume that a cyclic order tt = {wq,wi, . . . ,Wp-\) of Eg{s) is given. We say 
that two edges ci = (wh,Wi) and 62 = (wj,Wi) are crossing (with respect to tt) 
if ei and 62 are not adjacent and the four end vertices appear in the order of 
Wh, WjjWi, Wi along tt (i.e., h-\-a = j-\-b = i-\-c = i (mod p) holds for some 
l<c< 6 <a<p— 1).A sequence of splittings at s is called noncrossing 
if no two split edges resulting from the sequence are crossing. We prove that 
there always exists a complete and noncrossing {k, s)-feasible splitting for even 
integers k, and such a splitting can be found in 0{n^{m -\- nlogn)) time. 

Next we consider a planar multigraph G = (V, E) with a vertex s G y of even 
degree. A complete splitting at s is called planarity-preserving if the resulting 
graph from the splitting remains planar. Based on the result of noncrossing 
splitting, we prove that, if k is an even integer with k < AG(y — s), then there 
always exists a complete (fc, s)-feasible and planarity-preserving splitting, and 
the splitting can be found in 0{n^ logn) time. For k = 3, we prove by a separate 
argument that there exists a complete (fc, s)-feasible and planarity-preserving 
splitting if the resulting graph is allowed to be re-embedded in the plane. 

Example 1. (a) Fig. 1(a) shows a graph Gi = (V,E) with CGi{s,Wi) = 1 and 
CGi(wi,Wi+i) = a, 0 < t < 3 for a given integer a > 1. Clearly, AGi(y — s) = 
k for k = 2a -I- 1. For a cyclic order tt = {wq,wi,W 2,W3 ), Gi has a unique 
complete (A:, s)-feasible splitting (i.e., splitting pair of (s,wo), (s,W2) and a pair 
of (s, wi), (s, W3)), which is crossing with respect to tt. This implies that, for every 
odd k > 3, there is a graph G with a designated vertex s and a cyclic order of 
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Fig. 1. Example of three planar graphs (a) G\, (b) G 2 , and (c) G 3 . 



/g(s) which has no complete and noncrossing (fc, s)-feasible splitting. Note that 
the planar Gi has a complete and planarity-preserving (fc, s)-feasible splitting (by 
putting one of the split edges in the inner area of cycle Gi = {W 07 wi, W 2 , W 3 }). 

(b) Fig. 1(b) shows a planar graph G 2 = (V,E) with CG 2 {wi,Wi+i) = a (mod 
12 ) for 0 < z < 11 and 0 ^ 2 ( 6 ) = 1 otherwise for an integer a > 1, which satisfies 
Ag 2 (^ — s) = k for k = 2a + 1. The G 2 has a unique complete {k, s)-splitting, 
which is not planarity-preserving unless the embedding of subgraph G 2 [V — s] is 
not changed; if G 2 [V — s] is re-embedded in the plane so that block components 
{w 2 , UI 3 , W 4 } and {zcs, wg, z«io} of G 2 [V — s] are flipped and two vertices W 3 and 
wg share the same inner face, then the complete (k, s)-splitting is now planarity- 
preserving. From this, we see that for every odd k > 3, there is a planar graph 
G with a designated vertex s which has no complete and planarity-preserving 
(A:, s)-feasible splitting (unless the embedding of G is re-embedded). 

(c) Let a > 2 be an integer, and consider the graph G 3 = (V,E) in Fig. 1(c), 

where cgj ( zcj, zci+i) = a for z G {1,7}, CG^iwijWi+i) = a (mod 12 ) for z G 
{0, 1, . . . , 11} — {1,7}, and ca^ie) = 1 otherwise. Clearly, Ag 3 (F — s) = k for 
A: = 2o -I- 1 (> 5). It is easily observed that the unique complete (A:, s)-feasible 
splitting is not planarity-preserving for any choice of re-embedding of G 3 in the 
plane. This implies that for every odd k > 5, there exists a graph which has no 
complete and planarity-preserving (k, s)-feasible splitting even if re-embedding 
after splitting is allowed. □ 

2 Preliminaries 

2.1 Computing s-Based Connectivity 

The vertex set F of a multigraph G = {V, E) are denoted by V (G). We say that 
a cut X separates two disjoint subsets Y and Y' of V if Y C X C V — Y' {or 
Y' C X CV — Y ). The local edge-connectivity \g{x, y) for two vertices x,y G V 
is defined to be the minimum size of a cut in G that separates x and y. A cut X 
crosses another cut Y if none of subsets X DY, X — Y, Y — X and V — {X U F) 
is empty. 
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An ordering vi,V 2 , ■ ■ ■ ,Vn of all vertices in V is called a maximum adjacency 
(MA) ordering in G if it satisfies cg{{v\,V 2 , ■ • ■ , fi}> t’i+i) > cg{{vi,V 2 , • ■ • , Vi}, 
Vj), 1 < i < j < n. 

Lemma 1. [8] Let G = {V, E) be a multigraph, and v\ he a vertex in V . 

(i) An MA ordering V\,V 2 , ■■■ ,Vn of vertices in G can be found in 0{m + n log n) 
time. 

(ii) The last two vertices Vn-i and satisfy Xcivn-ijUn) = CG(vn)- □ 

Using this lemma repeatedly, we can compute Xg(V — s) by the next algo- 
rithm. 

Algorithm CONTRACT 

Input: A multigraph G = (V,E) with |U| > 3 and a designated vertex s € V. 
Output: an s-proper cut X* with cg(X*) = Xg(V — s) < Xg(X*). 

1 a := min{cG(v) | u € U — s}; 

2 Let X := {?;} for a vertex v G V — s with cg(v) = a; 

3 H :=G; 

4 while \V{E[)\ > 4 do { ch{u) > a holds for all u G V{H) — s } 

5 Find an MA-ordering in El starting from v\ = s, and let v,w (yf s) be the 
last two vertices in this ordering; { Xh{v,w) = ch{w) by Lemma l(ii) } 

6 Contract v and w into a vertex, say z, and let H be the resulting graph; 

7 if ch{z) < a then 

8 Let X* be the set of all vertices in U — s contracted into 2 so far; 

{ ch{z) = cg{X*) } 

9 end { if } 

10 end. { while } 



It should be noted that for each u G V(H) — s ch{u) >a holds before every 
iteration of the while-loop. The last two vertices u, w in an MA ordering in line 5, 
which are clearly distinct from s, satisfy Xh{v,w) = ch{w) by Lemma l(ii). 
Let X* be the cut output by CONTRACT, and a* be the final value of a 
(i.e., a* = cg{X*)). Note that any two vertices v and w in line 5 have been 
contracted into a single vertex only when Xh{v,w) > a* holds. We prove that 
a* = Xg{V — s). For a vertex u G V(H) — s, let A„ denote the set of all 
vertices in U — s contracted so far. Assume that there is an s-proper cut V with 
cg(U) < a*. Clearly, the final graph E[ has three vertices z\, Z 2 and s, and 
satisfies a* > min{c//(zi), Ci/(z 2 )}, and thus Y ^ Hence there is a 

vertex pair v,w G V (El) chosen in line 5 at some iteration of the while-loop such 
that A„ C A and A^ C (U - s) - A (or A^„ C A and A„ C (A - s) - A). 
Assume that v and w are the vertices in the earliest iteration of the while-loops 
among such pairs of vertices. This implies that when v and w are contracted 
into a vertex, the current graph E[ has a subset A' C V{H) — s such that 
Uy^Y'Xy = A. However, Xh{v,w) < ch{Y') = cg{Y) < a*, contradicting 
Xh{v,w) > a*. Therefore, a* = Xg{V — s). 
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Now we show Ag(-^*) > a*. Assume that the output cut X* is found in 
line 8 in the z-th iteration of the while-loop (the case where the X* is found 
at line 3 is trivial), and let a' be the value of a before the z-th iteration. Then 
a' > a*, and any two vertices v and w in line 5 in any earlier iteration have 
been contracted into a single vertex only when \h{v, w) > a' . Analogously with 
the above argument, we see that when CONTRACT finds the final cut X* in 
line 8, Xa{u,u') > o! holds for u,v! G X* (hence Ag(A1*) > a*). This leads to 
the next lemma, where the running time clearly follows from Lemma l(i). 

Lemma 2. For a multigraph G = (V,E) with a designated vertex s G C, CON- 
TRACT computes a cut X* such that cg{X*) = Xg(V — s) < Xg{X*) in 
0{n{m + nlogn)) time. □ 



2.2 Splitting Edges for a Pair of Neighbors 

Given a multigraph G = (V,E), a, designated vertex s G R, vertices u,v G Eg{s) 
(possibly u = v) and a non-negative integer S < min{cG(s, zz), cg(s, w)}, we 
construct graph G' = {V,E') from G by deleting S edges from Eg{s,u) and 
Eg{s,u), respectively, and adding new 6 edges to Eg{u,v). We say that G' is 
obtained from G by splitting S pairs of edges (s, u) and (s, v) by size S, and 
denote the resulting graph G' by G/{u, v, 6). Clearly, for any s-proper cut A, we 
see that 



(Y\-j cg(A) - 2(5 if zz, z; G A 

CG/{u,v,5)(-^) |cg(A) Otherwise. 



( 1 ) 



Given an integer k satisfying 0 < /c < Ag(R — s), we say that splitting <5 pairs 
of edges (s,zz) and (s,z;) is {k, s)- feasible if Ag/(«,i,, 5 )(R — s) > k. 

For an integer k, let Ag{u, v, k) be the maximum S such that splitting edges 
(s,zz) and (s,v) with size 6 is (A:, s)-feasible in G. In this subsection, we show 
how to compute AG{u,v,k). An s-proper cut A is called {k, s)- semi- critical in 
G if it satisfies cg(s. A) > 0, A: < cg(A) < A; -|- 1 and Ag(A) > k. 

An algorithm, called MAXSPLIT(zz, u, A:), for computing AG{u,v,k) is de- 
scribed as follows. 

1. Let 6max = min{cG(s, u), cg{s, z;)} if zz yf u, and 6max = Lcg(s, zz)/2j iiu = v, 
and let Grnax — G / {UjVjdmax)- 

2. Compute Ag„„, (R - s) and an s-proper cut A with cg„„, (A) = Ag„„, (R - 
s) < ^Grr^axi^) (such A cxists by Lemma 2). If Ag„„„(R - s) > k, then 
Ag{u,v, k) = Smax, where at least one of u and v is no longer a neighbor of 
s in Gmax in the case u ^ v, or CG^ax (S) zz) < 1 in the case u = v. 

3. If A: - XG^ax - s) > then zz, z; G A (for otherwise cg(A) = cg„„„ {X) < k 
would hold). Output AG{u,v,k) = Smax—\^{k—XGmax(^~^))^ and the s-proper 
cut A as such a {k, s)-semi-critical cut with zz, z; G A. 

The correctness of step 2 is clear. In step 3, we see from (1) that G' = 
G/ (zz, V, 5) with 6 

— ^max [^(fc - AG„a,(R - s))l satisfies Ag'(R - s) = A: or 
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k + 1. This implies that Ac{u,v,k) = 6. We show that the X has a property 
that 

co'{Z) > co'{X) for any Z with u,v G Z C X , 

where we call such a (fc, s)-semi-critical cut X with u,v G X admissible (with 
respect to u,v) in G' . For any Z with u,v G Z C AT, we have cg'{Z) = 
CGrr^a. (Z) + 2\^{k- ~ s) )1 >CG'(X), siuce (X ) > Ag„,, (V" - s) 

implies CG^axi^) > '^Gmaxi^)- By summarizing this, we have the next result. 

Lemma 3. For a multigraph G = (V, E) with a designated vertex s G V , and 
vertices u,v G Fg{s) {possibly u = v), let k be a nonnegative integer with k < 
^g{V — s), and let G' = Gj {u, v, 6) for 6 = Ag{u, v, k). Then: 

(i) If CG'{s,u) > 0 and cg'{s,v) > 0 in the case u ^ v or if cg>{s,u) >2 in the 
case u = v, then G' has an admissible cut X . 

(ii) The cut X in {i) {if any) and AG{u,v,k) can be computed in 0{mn+n^logn) 

time. □ 

3 Noncrossing Edge Splitting 

For a cyclic order tt = {wq,wi, . . . ,Wp-i) of Tg{s), a sequence of splittings at 
s is called noncrossing (with respect to tt) if no two split edges {wh,Wi) and 
{wj,Wi) are crossing with respect to tt (see Section 1 for the definition). In this 
section, we show that for any even k < Xg{V — s), there always exists a complete 
and noncrossing (fc, s)-splitting. However, as observed in Example 1(a), for every 
odd k >3, there is a graph that has no such splitting. 



3.1 (fc, s)-Semi-critical Collections 

Before computing a complete {k, s)-feasible splitting, we first find a family X 
of subsets oi V — s (by performing some noncrossing edge splittings at s) as 
follows. For a multigraph G = (E, E) and s gV, & family T = {Xi,X 2 , ■ ■ ■ , Xr} 
of disjoint subsets Al^ C E — s is called a collection in E — s. A collection fb 
may be empty. A collection X is called covering if <^G{s,Xi) = cg{s). A 
collection A in E — s is called {k, s)- semi- critical in G either if A = 0 or if 
all Xi G X are {k, s)-semi-critical. We can easily see that a {k, s)-semi-critical 
covering collection in G with cg(s) > 0 satisfies |A| > 2 [9]. 

Let X be an s-proper cut with cg{X) < fc + 1. Clearly, splitting two edges 
(s, u) and (s, v) such that u, G A is not {k, s)-feasible. Then the size of any cut 
Z C X remains unchanged after any (fc, s)-feasible splitting in G. We say that 
two s-proper cuts X and Y s-cross each other if X and Y cross each other and 
cg(s, A n F) > 0. It is not difficult to prove the following properties by using 
submodularity of cut function cg (the detail is omitted). 

Lemma 4. Let G = {V, E) be a multigraph with a designated vertex s, and k be 
an integer with k < \g{V — s) . Then: 




102 Hiroshi Nagamochi and Peter Eades 



(i) If two {k, s)- semi- critical cuts X and Y s-cross each other, then cc{X) = 
cg(Y) = k+l, cg{X -Y) = cg(Y -X) = k and cg{X n F, H - (X U F)) = 1. 

(ii) Let X he an admissible cut with respect to u,u' G V — s (possibly u = u'), 
and Y he a (k, s)- semi- critical cut. If X and Y cross each other, then cg(X) = 
cg(Y) = k -\-1 and cg(Y — X) = k. 

(iii) Let Xi (resp., Xj) he admissible cuts with respect to Ui, u( (resp., with respect 

to Uj,Uj), where possibly Ui = u( or Uj = m' holds. If {ui,u(} n {uj,u'I\ = 0 or 
cg(s,u) > 2 for some u G {ui,u(} n {uj,u'j}, then two cuts Xi and Xj do not 
cross each other. □ 

We now describe an algorithm, called COLLECTION, which computes a 
(k, s)-semi-critical covering collection T in a graph G* obtained from G by a 
noncrossing sequence of (k, s)-feasible splittings. Let tt = (wq, wi, . . . , Wp-i) be 
a cyclic order of / g(s). In a graph G' obtained from G by a noncrossing sequence 
of (k, s)-feasible splittings, a vertex wj is called a successor of a vertex Wi if 
CG'(s, Wj) > 1 and h > 0 with j = i -\- h (mod p) is minimum. 

0. Initialize X to be 0. 

1. for each Wi, i := 0, . . . ,p — 1 do 

if Wi is not in any cut X G X then execute MAXSPLIT(wi, Wj, /c) to compute 
G' = G/(wi,Wi,S) with S = AG(wi,Wi, k) and an admissible cut in G' (if 
CG'(s,Wi) > 2); let G := G'; 

if CG(s,Wi) > 2 then X := X U {Xi„^}, discarding all X G X with X C X^,, 
from X . 
end { for } 

2. for each Wi such that cg(s, Wi) = 1, z := 0, . . . ,p — 1 do 

if Wi is not in any cut X G X then execute MAXSPLIT(zz;i, fc) for Wi and 
the successor Wj of s in the current G to compute G' = G/(wi,Wj,S) with 
S = AG(wi,Wj,k) and an admissible cut in G' (if CG'(s,Wi) = 1); let 
G := G'; 

if cg(s, Wi) = 1 then A := {X — | cg(s, X — X^fj >0, X G X}U {Xj„.}. 

else (if cg(s, Wi) = 0) remove any cut X with cg(s, A) = 0 from X. 
end { for } 

Output G* := G. □ 

Clearly, the resulting sequence of splitting is (k, s)-feasible and noncrossing. 

Lemma 5. Algorithm COLLECTION correctly computes a (k, s)- semi- critical 
covering collection X in the output graph G* . 

Proof: Let X be the set of cuts obtained after step I . If two cuts Ai„ . , X^j G X 
(0 < z < j < p — 1) has a common vertex v, then wj ^ Ai„. and X^^ — X^^ 0 

(otherwise, Ai„. must have been discarded). However, this implies that A^j^ and 
X.^jj cross each other, contradicting Lemma 4(iii). Thus, the A is a (/c, s)-semi- 
critical collection. 

Now we prove by induction that A is a (k, s)-semi-critical collection dur- 
ing step 2. Assume that MAXSPLIT(zci, zcj, fc) is executed to compute G' = 
G/(wi,Wj,S) with S = AG(wi,Wj,k). If CG'(s,Wi) = 0, then a cut X G X with 
Wj G X may satisfy cg'(s,X) = 0 after the splitting. However, such a cut will 
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be removed from X. If CG'{s,Wi) = 1, then an admissible cut in G' is 
found. Clearly, any X G X satisfies one of the followings: (i) X n X^^ = 0, 
(ii) X C X^i, and (iii) X n X^. ^ X — Since df is updated to 

{X — Xtui I cg{s, X — Xim) >0, X G X} U it is sufficient to show that 

CG'{X — = k holds in the case (iii) (note that Xg'{X — X^m) > k follows 

from ^ k). Since two cuts X and cross each other in the case (iii), 

CG'{X — ATujJ = k follows from Lemma 4(ii). This proves that X remains to be 
a (fc, s)-semi-critical collection, which becomes covering after step 2. □ 

3.2 Algorithm for Noncrossing Edge-Splitting 

In this subsection, k is assumed to be a positive even integer. We can prove the 
next property by Lemma 4(i) and the evenness of k (the detail is omitted). 

Lemma 6. Let G = (V) E) he a multigraph with a designated vertex s, and k he 
an even integer with k < Xg{V — s). Further, let X he a (fc, s)- semi- critical cut, 
and Y and Y' he {k, s)- semi- critical cuts with Y C\Y' = %. Then X can s-cross 
at most one ofY and Y' . □ 

Using the lemma, we now describe an algorithm that constructs a complete 
and noncrossing (fc, s)-feasible splitting from a given (fc, s)-semi-critical covering 
collection df in a graph G with a designated vertex s of even cg(s). 

If s has at most three neighbors, then any complete (fc, s)-feasible splitting 
is noncrossing (with respect to any cyclic order of /^(s)) and such a splitting 
can be found by applying MAXSPLIT at most three times. In what follows, we 
assume that |Tg(s)| > 4. 

First, we define a notion of segment. For a given covering collection X with 
|df| > 2 in a multigraph G with a designated vertex s and a cyclic order tt = 
(wq, wi, . . . , Wp-i) of Fg{s), a subset P C Fg{s) of neighbors of s which are 
consecutive in the cyclic order is called segment if there is a cut X G X with 
PCX such that P is maximal subject to this property. Note that there may be 
two segments P and P' with P U P' C A for the same cut X G X. A segment 
P with |P| = 1 is called trivial. We now describe the two cases. 

Case 1: There is a nontrivial segment P = {wi, Wi+i, . . . , Wj} (with respect 
to X). Let Xi, X 2 and A3 be the cuts in df such that Wi-i G Ai, {wi, . . . , wj} C 
X 2 and Wj+i G A3 (possibly Ai = A3). We execute MAXSPLIT(wi_i, Wj, A:) 
and then MAXSPLIT(rt;j, Wj+i, A:). Let G" be the resulting graph. We first re- 
move all cuts X G X with cg"(s,A) = 0 from X. If one of Wi-i,Wi,wj,Wj+i 
is no longer a neighbor of s in G", then the number of neighbors of s decreases 
at least by one. Let us consider the case where all of Wi-i,Wi,Wj,Wj+i remain 
neighbors of s in G". Thus, G" has admissible cuts Yi and Yj (with respect 
to Wi-i,Wi and Wj,Wj+i, respectively). In the case of Wi-i = Wj+i, it holds 
CG"{s,Wi-i) > 2, since otherwise cg"(s, X 2) > 3 (by evenness of cg”(s)) would 
imply CG"(y - X 2 ) = CG"(X2) - cg"{s,X 2 ) -\- cg"{s,Xi) < {k -\- 1) - 3 -\- 1 < k, 
contradicting Ag"(^ — s) > k. Thus, by Lemma 4(iii), two cuts Yi and Yj do 
not cross each other. There are two subcases (a) Fj n 1} = 0 and (b) Yi C Yj or 
Y, c y,. 
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(a) YiHYj = 0. We prove that Xi ^ X3 and XiU X2U X3 CYiLlYj. Then 

two (fc, s)-semi-critical cuts Yi and X2 s-cross each other, and by Lemma 4(i) 
CG"{X2 — Yi) = CG"{Yi — X2) = k. Note that X2 — li is a {k, s)-semi-critical cut. 
Thus X2~Yi cannot cross another admissible cut Yj (otherwise CG"{X2 — Yi) = k 
would contradict Lemma 4(ii)), and hence X2 C YiUYj. By Lemma 6, Yi which 
already crosses X2 cannot s-cross Xi, and thus Xi C Yi. Similarly, we have 
X3 C Yj. Therefore, Xi yf X^ and X\ U X2 U X^ C U Yj. There may be some 
cut X G X — lXi, X2, X3} which crosses one of Yi or Yj. If the X crosses Yi, then 
CG"{X — Yi) = k by Lemma 4(ii). We see that cg"(s,X — Yi) = cg"(s,X) > 1, 
because if cg"{s,X n Yj) > 1 (i.e., X and Yi s-cross each other) then cg"(^ ~ 
X) = k < k + I = CG”{Yi) by Lemma 4(i), contradicting the admissibility of 
Yi (note {wi-i,Wi} CYi — X). Thus X — Yi is a, (fc, s)-semi-critical cut in G” . 
Similarly, If the X crosses Yj, then X — Yj is a (fc, s)-semi-critical cut in G" . 
Hence ii X (^Yi ^ ^ X C] Yj, then X — Yi — Yj is also a (fc, s)-semi-critical cut 

in G" . Therefore, we can update T by T := {X — Yi — Yj \ X G X} U {Yi, Yj}. 

(b) Yi C Yj or Yj C Yi. Without loss of generality, assume Y, C Yi. Since 
CG"{s,X2 r\ Yi) > 2 holds by Wi,Wj G X2 H Yi, we see by Lemma 4(i) that 
X2 cannot cross Yi (hence, X2 C Yi). If Xi = X3, then cg"{s,Xi C\Yi) > 2 
(including the case Wi-i = Wj+i) and by Lemma 4(i) Xi = X3 cannot cross Yi 
(hence, X\ = X^ G Yi). In the case X\ yf X^, at most one of Xi and X3 can 
s-cross Yi by Lemma 6. Thus Xi C Yi or X^ C Yi. For any cut X G X — {X3} 
which crosses Yi, we can show that X — Yi is a (A:, s)-semi-critical cut in G” 
using similar reasoning as for Case 1(a). Therefore, we can update T by T := 
{X -Y, I CG"(s,X - Yi) > 0 , X G X}U [Yi] (note that cg"(s,Xi - Y,) or 
CG"{s, Xz - Y{) may be 0 ). 

Notice that the number of cuts in X in cases (a) and (b) decreases at least 
by one after updating. 

Case 2: All segments are trivial. We choose a neighbor Wi of s and its succes- 
sor Wj, and execute MAXSPLIT(wi, Wj,k). Assume that MAXSPLIT(wi, Wj,k) 
finds an admissible cut Y (otherwise, the number of neighbors of s decreases 
at least by one). Let Xi and X2 be the cuts in X which contain Wi and Wj, 
respectively. We see that Ai C Y or A2 C Y, because otherwise Y would s-cross 
both Xi and A2 (contradicting Lemma 6). If X\ s-crosses Y, then we see by 
Lemma 4(ii) that Ai — Y is a (A:, s)-semi-critical cut if cg"{s,Xi — Y) > 1. 
The case where A2 s-crosses Y is similar. For any cut X G X — {Ai, A2} which 
crosses Y, we can show that A — Y is a (A;, s)-semi-critical cut in G" using similar 
reasoning as for Case 1(a). We update A by A := {A — Y | cg(s,X — Y) > 
1 , AgA}U{Y}. In this case, the number of cuts in A never increases, but it 
may not decrease, either. However, Y contains a nontrivial segment in the new 
A, and we can apply the above argument of Case 1. 

By applying the above argument to Case 1, at least one vertex is no longer a 
neighbor of s or the number of cuts in a collection A is decreased at least by one. 
After applying the argument of Case 2, at least one vertex is no longer a neighbor 
of s or a nontrivial segment is created. Therefore, by executing MAXSPLIT at 
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most 4(|r'G(s)| + |A’|) = 0(|rG(s)|) times, we can obtain a complete {k, s)-feasible 
splitting of a given graph G, which is obviously noncrossing. 

Theorem 2. Given a multigraph G = (V,E) with a designated vertex s & V 
of even degree, a positive even integer k < Xa{V — s), and a cyclic order tt of 
neighbors of s, a complete and noncrossing {k, s)- feasible splitting can be found 
in 0{\ra{s)\n{m + nlogn)) time. □ 

4 Planarity-Preserving Splitting 

In this section, we assume that a given graph G with a designated vertex s of 
even degree and an integer k < Xg(V — s) is planar, and consider whether there 
is a complete and planarity-preserving (fc, s)-feasible splitting. We prove that 
such splitting always exists if k is even or k = 3, but may not exist if k is odd 
and fc > 5, as observed in Example 1(c). We initially fix an embedding -tp of G 
in the plane, and let tt,/, be the order of neighbors of s that appear around s in 
the embedding y of G. 



4.1 The Case of Even k 

Clearly, a complete splitting at s is planarity-preserving if it is noncrossing with 
respect to tt^. Therefore, in the case of even integers k, the next theorem is 
immediate from Theorem 2 and the fact that m is 0(n) in a planar graph G. 

Theorem 3. Given a planar multigraph G = (V, E) with a designated vertex 
s G V of even degree, and a positive even integer k < Xg{V — s), there exists a 
complete and planarity-preserving {k, s)- feasible splitting {which also preserves 
the embedding of G[V — s] in the plane), and such splitting can be found in 
0(|/G(s)|n^ logn) time. □ 



4.2 The Case of fe = 3 

For fc = 3 < Xg{V — s), we can prove that there is a complete and planarity- 
preserving (fc, s)-feasible splitting. However, as observed in Example 1(b), in 
this case we may need to re-embed the subgraph G\V — s] in the plane to ob- 
tain such a splitting. We will show how to obtain a complete (3, s)-feasible and 
planarity-preserving splitting. Firstly, however, we describe a preprocessing al- 
gorithm based on the following Lemma 7 (the proof is omitted) . 

A set Y of vertices in G (or the induced graph G[T]) is called a k-component 
if Xg{u,v) > k for all u,v G Y and |T| is maximal subject to this property. 
It is easy to see that the set of fc-components in a graph is unique, and forms 
a partition of the vertex set. Such a partition can be found in linear time for 
fc = 1, 2 [10] and for /c = 3 [7,12]. A /c-component Y (or the induced graph G[T]) 
is called a leaf k-component if cg{Y) < k. Note that A( 3 [y_s](E — s) means the 
edge-connectivity of the subgraph G\V — s]. 
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Lemma 7. Let G = (V, E) be a multigraph with a designated vertex s of even 
degree such that — s) > 3. 

(i) Any s-proper cut X with ca{X) < 4 induces a connected subgraph G[AT]. 

(ii) Assume A( 3 [y_s] — s) = 0, and let u and v be two neighbors of s such that 

they belong to different 1-components in G[V — s]. Then Aa{u,v,3) > 1. 

(iii) Assume AG[y_sj(y — s) = 1. Let Y be a leaf 2-component in G\V — s]. Lf 
Aa{u,v,3) = 0 for some neighbors u G Lb(s) n Y and v G 7g(s) — Y, then 
Tg{s) — Y — v^% and Ag(u', v',3) > 1 for any neighbors v' G 7g(s) —Y — v 
and u' G Tg{s) n Y . 

(iv) Assume Ac[y_s](y — s) = 2. Let Y be a 3-component with cg{s,Y) > 2 
in G\V — s]. Then Ag{u,v,'A) > 1 for any neighbors u G Tg{s) H Y and 

V G Tg{s) - Y. 

(v) Assume AG[y-s](^— s) = 2. LetY be a non-leaf 3-component with cg{s,Y) = 
1 in G\V — s]. Lf Ag{u,v,3) = 0 for some neighbors u G Tg(s) n Y and 

V G Tg(s) — Y, then rG{s) — Y — v 0 and Ag{u,v' ,3) > 1 for any neighbor 

v' G Tg(s) —Y — v. □ 

Based on this lemma, for a given cyclic order tt of Tg{s), we can find a 
noncrossing sequence of {k, s)-feasible splittings (which may not be complete) 
such that the resulting graph G* satisfies either cg*(s) = 0 (i.e., the obtained 
splitting is complete) or the following condition: 



A( 3 .[y_ 5 ](y — s) = 2, and cg*(s,Y) = 1 for all leaf 3-components Y, 
and CG* (s, Y') = 0 for all non-leaf 3-components Y' in G* [H — s]. 



( 2 ) 



The entire preprocessing is described as follows. 



Algorithm PREPROCESS 

Input: A multigraph G = (V, E) (which is not necessarily planar), a designated 
vertex s G V with even degree and Ag(E — s) > 3, and a cyclic order tt of Ig(s)- 
Output: A multigraph G* obtained from G by a noncrossing (3, s)-feasible 
splitting such that G* satisfies either cg*(s) = 0 or (2). 

1 G' := G; 

2 while G' [V — s] is not connected do 

3 Find a neighbor w G Eg'{s) and its successor w' G T'g'(s) such that 
w and w' belong to different 1-components in G' [E — s] ; 

4 G' := G'/(w,w',l) 

5 end; { while } 

6 while G'\V — s] is not 2-edge-connected (i.e., AG'[y-s](E — s) = 1) do 

7 Choose a leaf 2-component Y in G' [E — s] ; 

8 Find a neighbor w G Eg'{s) C Y and its successor w' G Tg'{s) — Y; 

9 if Ag(w, w', 3) > 1 then G' := G' /{w, w', 1) 

10 else 

11 Find a neighbor w” G Tg'{s) — Y and its successor w'” G Eg'{s) C Y, 
and G' := G' / {w" , w'" , 1) 

end { if } 



12 
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13 end; { while } 

14 while G'\V — s] has a 3-component Y with cg'{s, F) > 2 or a non-leaf 

3-component Y with cg'{s,Y) = 1 do 

15 Find neighbors w G Yds) n Y and w' G Yds) — Y such that w' is 
the successor of w, 

16 if Ag{w, w', 3) > 1 then G' := G' /{w, w', 1) 

17 else { F is a non-leaf 3-component with cds,Y) = 1 } 

18 Find a neighbor w” G Yds) — Y such that w is the successor of w” , 
and G" := G'/{w",w,l) 

19 end { if } 

20 end; { while } 

21 Output G* := G'. 



Correctness of PREPROCESS easily follows from Lemma 7. Clearly, the 
number of splitting operations carried out in PREPROCESS is 0{n), and each 
splitting (including testing if Ag{w,w' ,3) > 1 in lines 9 and 16) can be per- 
formed in linear time. Therefore, PREPROCESS runs in O(n^) time. 

Lemma 8. Yor a multigraph G = (V,E)> o, designated vertex s G F with an even 
degree and Xg{V — s) > 3, and a cyclic order tt of Yg{s), there is a noncrossing 
sequence o/(3, s)-fe<^sible splitting such that the resulting graph G* satisfies either 
CG*(s) = 0 or (2), and such a splitting can be found in O(n^) time. □ 

Now we describe how to obtain a complete (3, s)-feasible and planarity- 
preserving splitting given (2). 

Let G* = (V,E*) be a multigraph satisfying (2), and let G” = {V",E") 
denote the graph G*\V — s]. Then \G"{y") = 2. A cut in a graph is a 2-cut 
if its cut size is 2. Now we consider a representation of all 2-cuts in G" . Let 
{Yx,Y 2 t ■ ■ dr} be the set of 3-components in G" , and Qg" = (V,£) denote 
the graph obtained from G" by contracting each 3-component F into a single 
vertex yi. For a vertex v € V, <PG"(v) denotes the vertex j/i G V into which 
V (possibly together with other vertices) is contracted. We can easily observe 
that Qg" is a cactus, i.e., a connected graph such that any edge is contained in 
exactly one cycle, where a cycle may be of length two. Clearly a cactus is an 
outerplanar graph, and any two cycles in a cactus have at most one common 
vertex. A vertex with degree 2 in a cactus is called a leaf vertex. Since any non- 
leaf vertex in a cactus is a cut-vertex, there may be many ways of embedding a 
cactus in the plane. For a vertex z G V in the cactus Qg'g we denote by 
the 3-component Yi such that F is contracted into z. It is easy to see that a 
subset F C F in G” is a leaf 3-component if and only if there is a leaf vertex 
z G V with (z) = F in Qg" . From this, we have the next lemma. 

Lemma 9. Let G* = (F, E*) be a multigraph, and s be a designated vertex with 
even degree. Assume that Ag*(F — s) > 3 and (2) holds. Let L{Qg") be the set 
of leaf vertices in cactus Qg" of G” = G*[F — s]. Then ipG" defines a bijection 
between Yg{s) and L{Qg>>). □ 
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Now assume that a given graph G = {V, E) with — s) > 3 is planar. 

Fix a planar embedding ip of G, and let tt^ be the cyclic order of Eg{s) in which 
neighbors in Ecis) appear along the outer face of G[H — s]. By applying PRE- 
PROCESS to G and a multigraph G* is obtained from G by a noncrossing 
(hence planarity-preserving) (3, s)-feasible splitting satisfying either cg*(s) = 0 
or (2). If CG*{s) = 0 (i.e., the splitting is complete) then we are done. Thus 
assume that G* satisfies (2). It is not difficult to see that G" = G*\V — s] and 
cactus Qg" have the following properties: 

(i) For any 2-cut X in G", Z = {Jx£XVG"{x) is a 2-cut in Qg"- (i-C., the two 
edges in Eg"{X) exist in the same cycle in Qg")- 

(ii) For any 2-cut Z in Qg", X = is a 2-cut in G". 

In other words, cactus Qg" represents all 2-cuts in G" = G*[P — s]. By Lemma 9, 
there is a bijection between Eg*{s) and L{Qg"), and thus Qg" has an even 
number of leaf vertices. A set cr of new edges which pairwise connect all leaf 
vertices in a cactus is called a leaf-matching. Hence finding a complete {k, s)- 
feasible splitting in G* is to obtain a leaf-matching cr in Qg" such that adding 
the leaf-matching destroys all 2-cuts in the cactus Qg" ■ 

However, to ensure that the complete splitting corresponding to a leaf-mat- 
ching preserves the planarity of G*, we need to choose a leaf-matching a of Qg" 
carefully. An embedding y of a cactus in the plane is called standard if all leaf 
vertices are located on the outer face of y. In particular, for a cyclic order tt of leaf 
vertices, a standard embedding y of a cactus is called a standard ir-embedding 
if the leaf vertices appear in the order of tt along the outer face of y. Note that 
a standard 7r-embedding of a cactus is unique (unless we distinguish one edge 
from the other in a cycle of length two). We define a flipping operation in an 
embedding y of cactus Q — {V,£) as follows. Choose a cycle G in ^ and a vertex 
z in G. We see that removal of the two edges in G incident to z creates two 
connected components, say Q' and Q"] we assume that z G V{Q'). Let G[C,z] 
denote the subgraph Q' of Q. We say that the embedding y of ^ is flipped by 
(G, z) if we fold the subgraph Q[G, z] back into the other side of area surrounded 
by G while fixing the other part of the embedding y. An embedding obtained 
from a standard 7r-embedding of a cactus by a sequence of dipping operations is 
called a x' -embedding. 

Recall that neighbors in Eg*{s) appear in cyclic order = (wi, . . . ,Wr) 
in an embedding y^ of G* . We also use to represent the cyclic order of 
leaf vertices zi = (p{wi),Z 2 = ip{w 2 ) , . ■ . , Zr = g^{wr) in cactus Qg"- Then the 
standard 7 t(j, - embedding y^ of Qg" has the following property: 

each vertex z G V in Qg" can be replaced with the subgraph , , 

G*[lp~^{z)] without creating crossing edges in y. ^ ' 

Observe that a dipping operation preserves property (3), and hence any tt- 
embedding y of Qg" also satisfies the property (3). 

A pair (cr, y) of a leaf-matching a on leaf vertices in a cactus Q and a tt- 
embedding y of tj is called a tt- configuration. A rr-configuration (cr, y) of Q is 
called good if it satisfies the following conditions. 
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(a) all 2-cuts in cactus Q = (V,f) are destroyed by adding a (i.e., for any 2-cut 
X, a contains an edge {z, z') with z G X and z' G V — X), and 

(b) all edges in a can be drawn in 7r-embedding x oi G without creating crossing 
edges. 

Now the problem of computing a complete and planarity-preserving (3, s)- 
feasible splitting in G* is to find a good -configuration (cr, y) of Gg"- To show 
that such a good Tr^-configuration always exists in Gg" , h suffices to prove the 
next lemma (the proof is omitted) . 

Lemma 10. For a cactus G = (V,£) and a cyclic order tt of leaf vertices, as- 
sume that there is a standard ir-emhedding of G ■ Then there always exists a good 
TT- configuration (cr, x) of G, and such a configuration can he found in 0(|Vp) 
time. □ 

By this lemma, a complete and planarity-preserving (3, s)-feasible splitting 
in a graph G* which satisfies (2) can be computed in 0{nf) time. This and 
Lemma 8 establish the following theorem. 

Theorem 4. Given a planar multigraph G = (V, E) with a designated vertex 
s G V of even degree, and Ag(1^ — s) > 3, there exists a complete and planarity- 
preserving {3, s)- feasible splitting, and such a splitting can he found in 0(ji^) 
time. □ 

5 Augmenting Edge-Connectivity of Outerplanar Graphs 

Given a multigraph G = (V, E) and a positive integer k, the fc-edge-connectivity 
(resp., /c- vertex-connectivity) augmentation problem asks to find a minimum 
number of new edges to be added to G such that the augmented graph be- 
comes fc-edge-connected (resp., A:- vertex-connected). Watanabe and Nakamura 
[11] proved that the fc-edge-connectivity augmentation problem for general fc is 
polynomially solvable. In such applications as graph drawing (see [3]), a planar 
graph G is given, and we may want to augment its edge- (or vertex-) connectivity 
optimally while preserving its planarity. Kant and Boldlaender [5] proved that 
the planarity-preserving version of 2-vertex-connectivity augmentation problem 
is NP-hard. Kant [4] also showed that, if a given graph G is outerplanar, then 
the planarity-preserving versions of both the 2-edge-connectivity and 2-vertex- 
connectivity can be solved in linear time. For a planar graph G, let Xk{G) (resp., 
7 fc(G)) denote the minimum number of new edges to be added to G so that the 
resulting graph G' becomes fc-edge-connected (resp., so that the resulting graph 
G" becomes fc-edge-connected and remains planar). Clearly, Xk{G) < Xk{G) for 
any planar graph G and fc > 1. From the results in the preceding sections, we 
can show the next result. 

Theorem 5. Let G = {V, E) be an outerplanar graph. If k > 0 is an even 
integer or k = 3, then Xk{G) = ^k{G) and the planarity-preserving version of 
the k- edge- connectivity augmentation problem can he solved in 0(n^(m-|-nlogn)) 
time. 
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Proof: (Sketch) Based on Theorems 3 and 4, we can apply the approach of 
Cai and Sun [1] (also see [2]) for solving the fc-edge-connectivity augmentation 
problem by using the splitting algorithm. The detail is omitted. □ 

Furthermore, for every odd integer k > 5, there is an outerplanar graph G 
such that jk{G) < 7fe(G). Consider the graph Gg obtained from the graph G3 
in Example 1(c) by deleting s and the edges in Eq' (s). It is easy to see that 
7fc(G'3) = 2<3 = 7fc(G(i). 

Remark: Given an undirected outerplanar network N = (V,L,c) and a real 
k > 0, we consider the /c-edge-connectivity augmentation problem which asks 
how to augment N by increasing link weights and by adding new links so that 
the resulting network N' = {V, LUV , c') is /c-edge-connected and remains planar 
while minimizing ~ (e)) + J2eGL' c' (e), where c and c' are allowed 

to be nonnegative reals. It is not difficult to observe that this problem can be 
solved in 0(n^(m + nlogn)) time by the argument given so far in this paper. 
(It would be interesting to see whether the problem can be formulated as a 
linear programming or not; if the planarity is not necessarily preserved then the 
problem is written as a linear programming.) 
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Abstract. Given a complete undirected graph with non-negative costs 
on the edges, the 2-Edge Connected Subgraph Problem consists in finding 
the minimum cost spanning 2-edge connected subgraph (where multi- 
edges are allowed in the solution). A lower bound for the minimum cost 
2-edge connected subgraph is obtained by solving the linear programming 
relaxation for this problem, which coincides with the subtour relaxation 
of the traveling salesman problem when the costs satisfy the triangle 
inequality. 

The simplest fractional solutions to the subtour relaxation are the |- 
integral solutions in which every edge variable has a value which is a 
multiple of We show that the minimum cost of a 2-edge connected 
subgraph is at most four-thirds the cost of the minimum cost |-integral 
solution of the subtour relaxation. This supports the long-standing | 
Conjecture for the TSP, which states that there is a Hamilton cycle which 
is within | times the cost of the optimal subtour relaxation solution when 
the costs satisfy the triangle inequality. 



1 Introduction 

The 2-Edge Connected Subgraph Problem is a fundamental problem in Sur- 
vivable Network Design. This problem arises in the design of communication 
networks that are resilient to single-link failures and is an important special case 
in the design of survivable networks [11,12,14]. 

1.1 Formulation 

An integer programming formulation for the 2-Edge Connected Subgraph Prob- 
lem is as follows. Let AT„ = (V,E) be the complete graph of feasible links on 
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which the 2-Edge Connected Subgraph Problem is formulated. We denote an 
edge of this graph whose endpoints are i G V and j G V hy ij. For each vertex 
V G V, let 6{v) C E denote the set of edges incident to v. For each subset of 
vertices S' C P, let (5(5') C F denote the set of edges in the cut which has S as 
one of the shores, i.e. the set of edges having exactly one endpoint in S. Denote 
the edge variable for e G E hy Xg, which is 0,1, or 2 depending on whether e is 
absent, occurs singly or doubly in the 2-edge connected subgraph. For A C E, 
let x{A) denote the sum X^eeA denote the cost of edge e. We have the 

following integer programming formulation. 

minimize c • x 
subject to 

x((5(w)) >2 for all u G P, . s 

x{5\s)) > 2 for all S C P, ^ ^ 

Xg >0 for all e G E, 

Xg integral. 

The LP relaxation is obtained by dropping the integrality constraint in this 
formulation. This LP relaxation is almost the same as the subtour relaxation 
for the Traveling Salesman Problem (TSP). The Traveling Salesman Problem 
consists in finding the minimum cost Hamilton cycle in a graph (a Hamilton 
cycle is a cycle which goes through all the vertices) . The subtour relaxation for 
the TSP is as follows. 

minimize c • x 
subject to 

x{S{v)) = 2 for all v GV, (2) 

x^J^S*)) > 2 for all S' C P, 

Xg >0 for all e G E. 

The constraints of the subtour relaxation are called the degree constraints, the 
subtour elimination constraints, and the non-negativity constraints respectively. 

If one has the relationship Cy < Cik + Cjk for all distinct i,j,k G V, then 
c is said to satisfy the triangle inequality. An interesting known result is that 
if the costs satisfy the triangle inequality, then there is an optimal solution to 
(1) which is also feasible and hence optimal for (2). This follows from a result 
of Cunningham [11] (A more general result called the Parsimonious Property is 
shown by Goemans and Bertsimas in [7]). We can show that this equivalence 
holds even when the costs do not satisfy the triangle inequality. In the latter 
case, we replace the given graph by its metric completion, namely, for every 
edge ij such that is greater than the cost of the shortest path between i and 
j in the given graph, we reset the cost to that of this shortest path. The intent 
is that if this edge is chosen in the solution of (1), we may replace it by the 
shortest cost path connecting i and j. Since multiedges are allowed in the 2-edge 
connected graph this transformation is valid. Hence without loss of generality, 
we can assume that the costs satisfy the triangle inequality. 
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1.2 Our Result and its Significance 

Our main result is the following. 

Theorem 1. The minimum cost of a 2-edge connected subgraph is within | 
times the cost of the optimal half-integral subtour solution for the TSP. 

This result is a first step towards proving the following conjecture we offer. 

Conjecture 1. The minimum cost of a 2-edge connected subgraph is within | 
times the cost of the optimal subtour solution for the TSP. 

By our remarks in the end of Section 1.1, it would follow from Conjecture 1 
that the minimum cost of a 2-edge connected subgraph is also within | times 
the cost of an optimal solution to the linear programming relaxation (1). 

We formulated Conjecture 1 as an intermediate step in proving the following 
stronger “four-thirds conjecture” on the subtour relaxation for the TSP, which 
would directly imply Conjecture 1. 

Conjecture 2. If the costs satisfy the triangle inequality, then the minimum cost 
of a Hamilton cycle is within | times the cost of the optimal subtour solution 
for the TSP. 

Note that Theorem 1 and Conjecture 1 imply similar relations between the 
fractional optimum of the subtour relaxation and a minimum-cost 2-vertex con- 
nected subgraph when the costs obey the triangle inequality. In particular. The- 
orem 1 implies that when the costs satisfy the triangle inequality, the minimum 
cost 2-vertex connected spanning subgraph is within | times the cost of the 
optimal half-integral subtour solution for the TSP. This follows from the simple 
observation that from the minimum-cost 2-edge connected graph, we can short- 
cut “over” any cut vertices without increasing the cost by using the triangle 
inequality [5,11]. 

1.3 Related Work 

A heuristic for finding a low cost Hamilton cycle was developed by Christofides 
in 1976 [4]. An analysis of this heuristic shows that the ratio is no worse than | in 
both Conjecture 1 and Conjecture 2. This analysis was done by Wolsey in [16] and 
by Shmoys and Williamson in [15]. A modification of the Christofides heuristic 
to find a low cost 2- vertex connected subgraph when the costs obey the triangle 
inequality was done by Fredrickson and Ja Ja in [5]. The performance guarantee 
for this heuristic to find a 2- vertex connected subgraph is |. There has also 
been a spate of work on approximation algorithms for survivable network design 
problems generalizing the 2-edge connected subgraph problem [7,8,9,10,13,17]; 
however, the performance guarantee for the 2-edge connected subgraph problem 
from these methods is at best | when the costs obey the triangle inequality 
(shown in [5,7]) and at best 2 when they do not (shown in [9]). 

Both Conjecture 2 and Conjecture 1 have remained open since Christofides 
developed his heuristic. In this paper, we suggest a line of attack for proving 
Conjecture 1. 
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2 Motivation 

In this section we discuss two distinct motivations that led us to focus on half- 
integral extreme points and prove a version of Conjecture 1 for this special case. 
One follows from a particular strategy to prove Conjecture 1 and the other 
from examining subclasses of subtour extreme points that are sufficient to prove 
Conjectures 1 and 2. 



2.1 A Strategy for Proving Conjecture 1 

Let an arbitrary point x* of the subtour polytope for be given. Multiply this 
by I to obtain the vector |a;*. Denote the edge incidence vector for a given 2-edge 
connected subgraph H in by ■ Note that edge variables could be 0,1, or 2 
in this incidence vector. Suppose we could express |a;* as a convex combination 
of incidence vectors of 2-edge connected subgraphs Hi for i = 1, 2, . . . , fc. That 
is, suppose that 



= (3) 

where Ai > 0 for i = 1,2, ... ,k and 

k 

Ea, = i. 

Then, taking dot products on both sides of (3) with the cost vector c yields 

= (4) 

Since the right hand side of (4) is a weighted average of the numbers c • x^' ? it 
follows that there exists a j G {1, 2, . . . , A:} such that 

c • X^^ ^ |c • X*. (5) 

If we could establish (5) for any subtour point x* , then it would in particular 
be valid for the optimal subtour point, which would prove Conjecture 1. 

In an attempt at proving Conjecture 1, we aim at contradicting the idea of a 
minimal counterexample, that is, a subtour point x* having the fewest number of 
vertices n' such that (3) can not hold for any set of 2-edge connected subgraphs. 
First we have the following observation. 

Theorem 2. At least one of the minimal eounterexamples x* to (3) holding 
(for some set of 2- edge eonnected subgraphs) is an extreme point of the suhtour 
polytope. 



Proof. Suppose x* = '^itiix\ where each x^ is an extreme point which is not 
a minimal counterexample, and the /i/’s satisfy the usual constraints for a set 
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of convex multipliers. Thus, for each I, we can find a set of 2-edge connected 
subgraphs Hi such that 




I 



where the A*’s satisfy the usual constraints for a set of convex multipliers. Then 

( 6 ) 

Since we have that 

I i I 

Equation (6) shows that ^x* can be expressed as a convex combination of 2-edge 
connected subgraphs as well, from which this theorem follows. 

Thus we need to focus only on minimal counterexamples x* in Kn> which are 
extreme points. To carry out the proof, we wish to find a substantial tight cut 
5{H) for X*, i.e. an H C V such that 3 < \H\ < n' — 3 and 

x*{6{H))=2. 

We can then split x* into 2 smaller subtour solutions x^ and x^ in the following 
way. Take the vertices of V \ H in x* and contract them to a single vertex to 
obtain x^ . Likewise, take the vertices of H in x* and contract them to a single 
vertex to obtain x"^ . An example of this is shown in Figure 1. 

Since x^ and x^ are not counterexamples to our conjecture, we would be able 
to decompose and into combinations of 2-edge connected subgraphs, 
which we may then attempt to glue together to form a similar combination for 
|a;*, thereby showing that x* is not a counterexample (We show how this can be 
accomplished for the case of half-integral extreme points in Case 1 in the Proof 
of Theorem 6). 

What if there are no tight substantial cuts however? The following proposi- 
tion which is shown in [1] shows us what we need to do. 

Proposition 1. If x* is an extreme point of the subtour polytope and has no 
substantial tight cuts, then x* is a 1 /2-integer solution. 

This led us to focus on 1/2-integral solutions x*, and we were able to complete 
the proof for this special case. In the next section, we show our main result that 
if x* is a 1/2-integer subtour solution, then (3) can always be satisfied. 

2.2 The Important Extreme Points 

Consider any extreme point x*. We wish to express ^x* as a convex combination 
of 2-edge connected graphs for Conjecture 1 or a convex combination of Eulerian 
graphs for Conjecture 2. An important question is what features of x* make it 
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Fig. 1. An idea for splitting a minimal counterexample into two smaller in- 
stances. Note that H defines a substantial tight cut, i.e., both H and V\H have 
at least three vertices and x{6{H)) = 2. 



difficult to do this? In an effort to answer this question, we try to transform 
X* into another extreme point x* on a larger graph so that x* belongs to a 
subclass of the extreme points, but |lr* is at least as hard to express as a convex 
combination of 2-edge connected graphs (or Eulerian graphs) as |x* is. The idea 
then is that we only have to be able to express as a convex combination of 
2-edge connected graphs (or Eulerian graphs) for all extreme points x belonging 
to this particular subclass in order to prove Conjecture 1 (or Conjecture 2). If 
we have a subclass S of extreme points x such that being able to express 
as a convex combination of 2-edge connected graphs for all extreme points x 
belonging to this particular subclass is sufficient to prove Conjecture 1, then we 
say that S is sufficient to prove Conjecture 1. Likewise, a subclass S can be 
sufficient to prove Conjecture 2. 

We have found two different subclasses of extreme points which are sufficient 
to prove both Conjecture 1 and Conjecture 2. In some sense, the extreme points 
in such a subclass are the hardest extreme points to deal with when proving 
Conjecture 1 or Conjecture 2. One class, termed fundamental extreme points, 
can be found in [2]. 

Definition 1. A fundamental extreme point is an extreme point for the suhtour 
relaxation satisfying the following conditions. 

(i) The support graph is 3-regular, 

(ii) There is a Tedge incident to each vertex, 

(Hi) The fractional edges form disjoint cycles of length j. 
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A second class of such sufficient extreme points is described below. We will 
restrict our attention to showing that the subclass described below is sufficient to 
prove Conjecture 1, although showing that it is also sufficient to prove Conjecture 
2 requires only minor modifications in our arguments. 

Consider any extreme point x* . Pick the smallest integer k such that x* is a 
multiple of ^ for every edge e G E. Then form a 2/c-regular 2fc-edge connected 
multigraph Gk = (P, Ek) as follows. For every edge e = uv G E, put I edges 
between u and v, where I := kx*. Then showing that can be expressed 
as a convex combination of 2-edge connected graphs is equivalent to showing 
that |a:* can be so expressed. But suppose every vertex in Gk is replaced by 
a circle of 2k nodes, each node with one edge from Ek, and 2fc — 1 new edges 
linking this node to its two neighboring nodes in the circle, all in such a way 
that the resulting graph Gk = (V, Ek) is still 2fc-regular and 2/c-edge connected. 
Note that loosely speaking, we have Ek C Ek- We seek to then show that if 
we can express as a convex combination of 2-edge connected graphs, then 
we can do so for ^Ek as well. The graph Gk will turn out to corresponds to a 
subtour extreme point x* (in the same way that Gk corresponds to x*). It is 
more convenient to define this subtour extreme point x* than to define Gk- 

Let us now define x* . 

Definition 2. Expand each vertex in V into a circle of 2k nodes, with an edge 
of Ek leaving each such node, as described in the previous paragraph. Take the 
equivalent of an Eulerian tour through all the edges of Ek by alternately travers- 
ing these edges and jumping from one node to another node in the same circle 
until you have traversed all of the edges in Ek and have come back to the edge in 
Ek you started with. When you jump from node u to node v in the same circle 
in this Eulerian tour, define := For every edge e G Ek, we naturally 

define x* := For each circle Cy of nodes corresponding to the vertex v G V, 
we pick an arbitrary perfect matching My on the nodes in Cy, including in My 
only edges e which have not yet been used in the definition ofx*. We then define 
X* := 1 for all e G My. 

We have the following: 

Theorem 3. x* in Definition 2 is a subtour extreme point. 

Proof. The support graph of x* is 3-regular, with the fractional edges in x* 
forming a Hamilton cycle on the vertices V . Call the edges in S'*’s support graph 
Ek- 

We first show that x* is a feasible subtour point. If it were not, there would 
have to be a cut in the graph Gk = (V,Ek) of value less than 2. Clearly, such 
a cut C would have to go through some circle Cy of nodes since Gk is 2fc-edge 
connected. But the contribution of the edges from the circle Cy to any cut 
crossing it is at least 1 since the edges in the circle Cy each have a value greater 
than or equal to 1/2. Hence, the contribution from the non-circle edges in the 
cut C is less than 1. But this is not possible because when v is ripped out of x*, 
the minimum cut in the remaining solution is greater than or equal to 1. Hence, 
X* is a feasible subtour point. 




A New Bound for the 2-Edge Connected Subgraph Problem 



119 



We show that x* is an extreme point by showing that it can not be expressed 
as where x^ and are distinct subtour points. Suppose x* could 

be so expressed. Then the support graphs of x^ and x'^ would coincide with or 
be subgraphs of the support graph Ek of x* . Because of the structure of the 
support graph, setting the value of just one fractional edge determines the entire 
solution due to the degree constraints. Hence, all the edges e G Ek such that 
Xe = would have to say be smaller than ^ in x^ and larger than ^ in But, 
then a cut separating any circle of nodes Cy from the rest of the vertices in x^ 
would have a value less than 2, which contradicts x^ being a subtour point. 

We now have the following: 

Theorem 4. If |a;* can be expressed as a convex combination of 2-edge con- 
nected graphs spanning V , then |x* can be expressed as a convex combination 
of 2-edge connected graphs spanning V. 

Proof. Suppose |lr* can be expressed as a convex combination 

= ( 7 ) 

where the Hfs are 2-edge connected graphs spanning V. For each i, contract 
each circle of nodes Cy back to the vertex v G V in Hi. Call the resulting graph 
Hi. Since contraction preserves edge connectivity. Hi is a 2-edge connected graph 
spanning V . When one performs this contraction on x* , one gets x*. As a result, 
we obtain that 



fa;* = (8) 

which proves our theorem. 

We can now define the subclass of important extreme points. 

Definition 3. An important extreme point is an extreme point for the subtour 
relaxation satisfying the following conditions. 

(i) The support graph is 3-regular, 

(a) There is a Tedge incident to each vertex, 

(Hi) The fractional edges form a Hamilton cycle. 

We are now ready for the culminating theorem of this section. 

Theorem 5. The subclass of important extreme points is sufficient to prove 
Conjecture 1. 

Proof. If there is an extreme point x* such that |x* cannot be expressed as a 
convex combination of 2-edge connected graphs, then by Theorem 4, the impor- 
tant extreme point |a;* cannot be expressed as a convex combination of 2-edge 
connected graphs either. Hence, our theorem follows. 

The analogous theorem for the class of fundamental extreme points can be found 
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3 The Proof of Theorem 1 

Let X* be a 1/2-integer subtour solution on = (V,E). Denote the edges of 
the support graph of x* (the set of edges e G E such that x* > 0) by E{x*). 
Construct the multigraph G{x*) = (V,E{x*)), where E{x*) D E{x*) and differs 
from E{x*) only in that there are two copies in E{x*) of every edge e G E(x*) 
for which x* = 1. Note that the parsimonious property [7] implies that there are 
no edges e with a;e > 1 in the optimal fractional solution. 

Because of the constraints of the subtour relaxation, it follows that G(x*) 
is a 4-regular 4-edge connected multigraph. Similarly, corresponding to every 4- 
regular 4-edge connected multigraph is a 1/ 2-integer subtour solution, although 
this solution may not be an extreme point. 

Showing (3) for some choice of 2-edge connected subgraphs Hi for every 
1/2- integer subtour solution x* would prove Conjecture 1 whenever the optimal 
subtour solution was 1/2-integer, as was discussed in the last section. So, equiva- 
lently to showing (3) for some choice of 2-edge connected subgraphs Hi for every 
1/2- integer subtour solution x* , we could show 

|x^(^) = EiAiX^S (9) 

where this expression is a convex combination of some chosen set of 2-edge 
connected subgraphs Hi, for every 4-regular 4-edge connected multigraph G = 
(V, E(G)). These are equivalent because of the remarks in the previous paragraph 
and the observation that G{x*) behaves like 2x* . 

It turns out that (9) is very difficult to show directly, but the following 
slight strengthening of it makes the task easier. Consider any 4-regular 4-edge 
connected multigraph G = {V,E{G)) and any edge e G E{G). Then, we prove 
instead that 

= (10) 

where this expression is a convex combination of some chosen set of 2-edge 
connected subgraphs Hi. 

For technical reasons, we will prove (10) with the additional restriction that 
none of the Hi’s may use more than one copy of any edge in E(G). Note however 
that G may itself have multiedges so H may also have multiedges. In the latter 
case, we think of two parallel multiedges in H as being copies of two distinct 
multiedges in G. 

For any 4-regular 4-edge connected graph G and any edge e G E{G), we 
define P{G,e) to be the following statement. 

Statement 1. P{G,e) For some finite set of 2-edge connected subgraphs Hi, 
we have (10), where Ai > 0 for all i and Xi = 1, and none of the Hi’s may 
use more than one copy of any edge in E{G). 

As noted above. Statement 1 does not rule out the possibility of doubled 
edges in the Hi’s because there may be doubled edges in G. 
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We define a tight cut for a 4-edge connected graph G to be a cut which has 
exactly 4-edges in it. We define a non-trivial cut for such a graph to be a cut 
where both shores have at least 2 vertices each. We have the following lemma. 

Lemma 1. Let G = {V, E) be a 4-Tegular 4~edge eonnected graph which has no 
tight non-trivial cut which includes an edge e = uv G E. Let the other 3 (not 
necessarily distinct) neighbors of v be x,y, and z. Then either ux or yz is a loop 
or G' = G — V ux yz is 4~^sgular and 4~edge connected, and likewise for the 
other combinations. 

Proof. Let G = (V, E) and e = uv G E he given, where the neighbors of v are 
as stated. First, note that any cut in G containing all four edges incident on v 
has size at least 8, since the cut formed by moving v to the opposite side of the 
cut must have size at least 4 since G is 4-edge connected. 

Suppose neither ux or yz is a loop. Then clearly, G' is a 4-regular connected 
graph. Since it is 4-regular, every cut has an even number of edges in it. By our 
earlier observation, there can be no cuts S{H) in G' of cardinality zero. Suppose 
G' has a non-trivial cut S{H) with only 2 edges in it. Consider G = G -\- ux -\- yz 
with vertex v back in. The two non-trivial cuts 5{H U {u}) and 5{{V \ H) U {u}) 
can each have at most 3 more edges each (for a total of 5 edges each) since as 
observed earlier, these cuts could not have all 4 edges incident to v in them. But, 
G = G — ux — yz has only cuts with an even number of edges in them since it 
is 4-regular. Hence the cuts 6{H U {u}) and (5((C \ H) U {u}) in G have at most 
4 edges in them. One of these two cuts is a tight non-trivial cut which contains 
e, which yields the lemma. 

We are now ready for our main theorem. 

Theorem 6. Let x* be a minimum cost 1 /2-integer subtour solution. Then there 
exists a 2-edge connected subgraph H such that c ■ < |c • x*. 

Proof. As remarked in the discussion before this theorem, it is sufficient to prove 
P(G, e) for all 4-regular 4-edge connected multigraphs G and for all e G E{G). To 
prove this, we show that a minimal counterexample to P(G, e) can not happen. 

Let G = {V, E{G)) he a, 4-regular 4-edge connected multigraph and e G E{G) 
which has the minimum number of vertices such that P(G, e) does not hold. 
Since by inspection, we can verify that P(G, e) holds when G has 3 vertices, 
we can assume that \V\ > 3. We now consider the cases where G has a tight 
non-trivial cut which includes edge e and where G has no tight non-trivial cut 
which includes e. 

Case 1: G has a tight non-trivial cut which includes edge e. 

Choose such a tight non-trivial cut and denote the edges other than e in this 
cut by a, b, and c. As before, consider contracting one of the shores of this cut to 
a single vertex vi. Denote the edges incident to vi, which corresponded to e, a, 6, 
and c, by ei,ai,bi, and ci respectively. This resulting graph Gi = {V\,Ei) can 
be seen to be 4-regular and 4-edge connected. (To see this, suppose there was a 
cut of cardinality less than four in G\ and let Hi be the shore of this cut not 
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containing v\. Then the cut 5{Hi) in G shows that G is not 4-edge-connected, 
a contradiction.) Since (G, e) was a minimal counterexample to P{G,e), we 
have P{Gi,ei). By contracting the other shore, we can get a 4-regular 4-edge 
connected graph G 2 , and we know that P{G 2 ,e 2 ) also holds. 

By P{Gi,ei) we have 



( 11 ) 

and by P(G 2 ,C 2 ) we have 

(12) 

In (11), consider the edges incident to v\ in each of the Hi's. There are clearly 
at least 2 such edges for every H} . The values of edges ai,6i,ci, and ci in 
2 ^E(Gi)\{ei} 2 ^ 2 ^ 2 ^ Q respectively. This adds up to 2. Hence, since we 

are dealing with convex combinations, which are weighted averages, when the 
weights are taken into account, the Hi’s have on average 2 edges incident to vi 
each. But since every Hi has at least 2 such edges, it follows that every Hi has 
exactly 2 edges incident to v\ in it. 

For each 2-edge connected subgraph Hi which has edges ai and bi, denote 
the corresponding convex multiplier by A“^. Define and Aj'^ similarly. One 
can see that the only way for the variable values of edges ai,5i, and ci to end 
up all being | in is for the following to hold: 

E.Af = E.Ar = EiAf = i. (13) 



Similarly, we must have 



E./if = = = i (14) 

Call the three types of 2-edge connected graphs Hi as o6-graphs, ac-graphs, 
and 6c-graphs. Our strategy is to combine say each o6-graph Hi of Gi with an 
a6-graph of G 2 to form an a6-graph Hlj’ of G which is also 2-edge connected. 
So, we define 



Hf/ := {Hi - vi) + [H] -v2) + a + b, (15) 

where Hi and Hj are o6-graphs. Since Hi — vi and Hj — V 2 are both connected, 
it follows that Hlj’ is 2-edge connected. Similarly define and H^j. 

Now consider the following expression: 






(16) 



One can verify that this is in fact a convex combination. Any edge / in say 
Gi — v\ occurs in (16) with a weight of 

Ep I ■ (3 • E, ) + Ar • (3 • E, Mf) + Af • (3 • E, ))• (17) 
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In light of (14) we have that (17) evaluates to 

E ^* = 1 ( 18 ) 



We have a similar identity when / is in G 2 — V 2 and we also have that edges a, b, 
and c each occur in (16) with a weight of | as well. Therefore we have 



E,,, 3Af /if + E.,, 3Ar/tf + E.,, 3Af /if (19) 



which contradicts (G, e) being a minimal counterexample. 

Case 2 : G has no tight non-trivial cut which includes edge e. 

Denote the endpoints of e by m S 1^ and v G V, and denote the other 3 not 
necessarily distinct neighbors of u in G by x,y, z G V. Because e is in no tight 
non-trivial cut, we have that x ^ y ^ z. (If any two of the neighbors a;, y and z 
are the same, say x = y, then the cut (5({f,a;}) will be a tight non-trivial cut). 
Thus, without loss of generality, if any two neighbors are the same vertex, we 
can assume that they are u and z. Hence, u ^ x and u ^ y. 

Define the graph Gi = (Vi, ifi) by 



Gi = G — V + ux + yz, 



(20) 



and define ei = ux. We know by Lemma 1 that Gi is 4-regular and 4-edge 
connected. Since (G, e) is a minimal counterexample, we therefore know that 
P(Gi,ei) holds. Similarly, define the graph G 2 = (V 2 ,if 2 ) by 



G2 = G — V + uy + xz, 



(21) 



and define 62 = uy. As before, we know that P{G 2 , 62 ) holds as well. 

So, we can form the following convex combinations of 2-edge connected 
graphs: 






and 



Define H} by 



m = 



Likewise, define Hf by 



Hf = 






Hi — yz + yv + zv for yz G Hi , 
Hi + yv + XV for yz ^ Hi . 



Hf — xz + XV + zv for xz G Hf, 
Hf + yv + XV for xz ^ Hf. 



( 22 ) 



(23) 



(24) 



(25) 
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Consider the convex combination of 2-edge connected subgraphs 



+ (26) 

Every edge in / e E\S{v) occurs with a total weight of | in (26) since / occured 
with that weight in both (22) and (23). Since yz occurs with a total weight of | 
in (22) and xz occurs with a total weight of | in (23), one can verify that xv, yv, 
and zv each occur with a total weight of | in (26) as well. Therefore, we have 



h 






(27) 



which contradicts G, e being a minimal counterexample. 



4 Concluding Remarks 

An obvious open problem arising from our work is to extend our strategy and 
settle Conjecture 1. In another direction, it would be interesting to apply our 
ideas to design a |-approximation algorithm for the minimum cost 2-edge- and 
2-vertex-connected subgraph problems. 

Another interesting question is the tightness of the bound proven in Theo- 
rem 1. The examples we have been able to construct seem to demonstrate an 
asymptotic ratio of | between the cost of a minimum cost 2-edge connected sub- 
graph and that of an optimal half-integral subtour solution. Finding instances 
with a worse ratio or improving our bound in Theorem 1 are open problems. 
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Abstract. We give a ^-approximation algorithm for the following NP- 
hard problem: 

Given a simple undirected graph, find a 2-edge connected span- 
ning subgraph that has the minimum number of edges. 

The best previous approximation guarantee was | . If the well known TSP 
I conjecture holds, then there is a |-approximation algorithm. Thus our 
main result gets half-way to this target. 



1 Introduction 

Given a simple undirected graph, consider the problem of finding a 2-edge con- 
nected spanning subgraph that has the minimum number of edges. The problem 
is NP-hard, since the Hamiltonian cycle problem reduces to it. A number of 
recent papers have focused on approximation algorithms ^ for this and other 
related problems, [2]. We use the abbreviation 2-ECSS for 2-edge connected 
spanning subgraph. 

Here is an easy 2-approximation algorithm for the problem: 

Take an ear decomposition of the given graph (see Section 2 for defini- 
tions), and discard all 1-ears (ears that consist of one edge). Then the 
resulting graph is 2-edge connected and has at most 2n — 3 edges, while 
the optimal subgraph has > n edges, where n is the number of nodes. 

^ An a- approximation algorithm for a combinatorial optimization problem runs in 
polynomial time and delivers a solution whose value is always within the factor a 
of the optimum value. The quantity a is called the approximation guarantee of the 
algorithm. 

R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 126-136, 1998. © Springer-Verlag Berlin Heidelberg 1998 
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Khuller & Vishkin [8] were the first to improve on the approximation guarantee 
of 2. They gave a simple and elegant algorithm based on depth- first search that 
achieves an approximation guarantee of 1.5. In an extended abstract, Garg, San- 
tosh & Singla [6] claimed to have a 1.25-approximation algorithm for the prob- 
lem. No proof of this claim is available; on the other hand, there is no evidence 
indicating that achieving an approximation guarantee of 1.25 in polynomial time 
is impossible. 

We improve Khuller & Vishkin’s ^-approximation guarantee to If the 
well known TSP | conjecture holds, then there is a |-approximation algorithm, 
see Section 5. Thus our main result gets half-way to this target. 

Let G = (V,E) be the given simple undirected graph, and let n and m denote 
\V\ and \E\. Assume that G is 2-edge connected. 

Our method is based on a matching-theory result of Andras Frank, namely, 
there is a good characterization for the minimum number of even-length ears 
over all possible ear decompositions of a graph, and moreover, an ear decom- 
position achieving this minimum can be computed efficiently, [4]. Recall that 
the 2-approximation heuristic starts with an arbitrary ear decomposition of G. 
Instead, if we start with an ear decomposition that maximizes the number of 

1- ears, and if we discard all the 1-ears, then we will obtain the optimal solution. 
In fact, we start with an ear decomposition that maximizes the number of odd- 
length ears. Now, discarding all the 1-ears gives an approximation guarantee of 
1.5 (see Proposition 8 below). To do better, we repeatedly apply “ear-splicing” 
steps to the starting ear decomposition to obtain a final ear decomposition such 
that the number of odd-length ears is the same, and moreover, the internal nodes 
of distinct 3-ears are nonadjacent. We employ two lower bounds to show that 
discarding all the 1-ears from the final ear decomposition gives an approximation 
guarantee of The first lower bound is the “component lower bound” due to 
Garg et al [6, Lemma 4.1], see Proposition 4 below. The second lower bound 
comes from the minimum number of even-length ears in an ear decomposition 
of G, see Proposition 7 below. 

After developing some preliminaries in Sections 2 and 3, we present our 
heuristic in Section 4. Section 5 shows that the well known | conjecture for the 
metric TSP implies that there is a |-approximation algorithm for a minimum- 
size 2-EGSS, see Theorem 18. Almost all of the results in Section 5 are well 
known, but we include the details to make the paper self-contained. Section 6 
has two examples showing that our analysis of the heuristic is tight. Section 6 
also compares the two lower bounds with the optimal value. 

A Useful Assumption 

For our heuristic to work, it is essential that the given graph be 2-node con- 
nected. Hence, in Section 4 of the paper where our heuristic is presented, we 
will assume that the given graph G is 2-node connected. Otherwise, if G is not 

2- node connected, we compute the blocks (i.e., the maximal 2-node connected 
subgraphs) of G, and apply the algorithm separately to each block. We compute 
a 2-EGSS for each block, and output the union of the edge sets as the edge set of 
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a 2-ECSS of G. The resulting graph has no cut edges since the subgraph found 
for each block has no cut edge, and moreover, the approximation guarantee for 
G is at most the maximum of the approximation guarantees for the blocks. 



2 Preliminaries 

Except in Section 5, all graphs are simple, that is, there are no loops nor multi- 
edges. A closed path means a cycle, and an open path means that all the nodes 
are distinct. 

An ear decomposition of the graph G is a partition of the edge set into open 
or closed paths, Pq + Pi + . . . + Pk, such that Pq is the trivial path with one 
node, and each (1 < i < fc) is a path that has both end nodes in Vi-i = 
V{Po) U V{Pi) U . . . U V{Pi-i) but has no internal nodes in Vi-i. A (closed 
or open) ear means one of the (closed or open) paths Pq, Pi, . . . , Pk in the ear 
decomposition, and for a nonnegative integer £, an Gear means an ear that has £ 
edges. An Gear is called even if £ is an even number, otherwise, the Gear is called 
odd. (The ear Pq is always even.) An open ear decomposition Pq + Pi + . . . + Pk 
is one such that all the ears P 2 ,. . . ,Pk are open. 

Proposition 1 (Whitney [12]). 

(i) A graph is 2-edge connected if and only if it has an ear decomposition. 

(ii) A graph is 2-node connected if and only if it has an open ear decomposition. 

An odd ear decomposition is one such that every ear (except the trivial path 
Pq) has an odd number of edges. A graph is called factor- critical if for every node 
V G V(G), there is a perfect matching in G — v. The next result gives another 
characterization of factor-critical graphs. 

Theorem 2 (Lovasz [9], Theorem 5.5.1 in [10]). A graph is factor- critical 
if and only if it has an odd ear decomposition. 

It follows that a factor-critical graph is necessarily 2-edge connected. An open 
odd ear decomposition Pq -\- Pi Pk is an odd ear decomposition such that 
all the ears P 2 , ■ • ■ , Pfe are open. 

Theorem 3 (Lovasz & Plummer, Theorem 5.5.2 in [10]). A 2-node con- 
nected factor- critical graph has an open odd ear decomposition. 

Let e{G) denote the minimum number of edges in a 2-ECSS of G. For a graph 
H, let c{H) denote the number of (connected) components of H. Garg et al [6, 
Lemma 4.1] use the following lower bound on e(G). 

Proposition 4. Let G = (V, E) be a 2-edge connected graph, and let S be a 
nonempty set of nodes such that the deletion of S results in a graph with c = 
c{G — S) >2 components. Then e(G) > |E | -I- c — [S'! . 
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Proof. Focus on an arbitrary component D of G — S and note that it contributes 

> \V{D) \ + 1 edges to an optimal 2-ECSS, because every node in D contributes 

> 2 edges, and at least two of these edges have exactly one end node in D. 

Summing over all components of G — S' gives the result. □ 

For a set of nodes S C E of a graph G = {V,E), 5{S) denotes the set of 
edges that have one end node in S and one end node in E — S. For the singleton 
node set {u}, we use the notation 5{v). For a vector x : E^M, x{6{S)) denotes 

J2eeS(S) 

3 Prank’s Theorem and a New Lower Bound for e 

For a 2-edge connected graph G, let ip{G) (or tp) denote the minimum number 
of even ears of length > 2, over all possible ear decompositions. For example: 
(p{G) = 0 if G is a factor-critical graph (e.g., G is an odd clique A' 2 ^-i-i or an 
odd cycle G 2 i+i), ^{G) = 1 if G is an even clique K 21 or an even cycle G 2 l^ and 
<p{G) = £ — 1 if G is the complete bipartite graph K 2 / {(.> 2). The proof of the 
next result appears in [4], see Theorem 4.5 and Section 2 of [4]. 

Theorem 5 (A. Prank [4]). Let G = {V,E) be a 2-edge connected graph. An 
ear decomposition Pq Pi Pk of G having ip{G) even ears of length > 2 

can he computed in time 0(|P| • |i?|). 

Proposition 6. Let G he a 2-node connected graph. An open ear decomposition 
Pq -\- Pi Pk of G having ip{G) even ears of length > 2 can he computed in 

time 0{\V\ ■ |E|). 

Proof. Start with an ear decomposition having ip{G) even ears of length > 2 (the 
ears may be open or closed) . Subdivide one edge in each even ear of length > 2 by 
adding one new node and one new edge. The resulting ear decomposition is odd. 
Hence, the resulting graph G' is factor critical, and also, G' is 2-node connected 
since G is 2-node connected. Apply Theorem 3 to construct an open odd ear 
decomposition of G' . Finally, in the resulting ear decomposition, “undo” the 
(fi{G) edge subdivisions to obtain the desired ear decomposition Pq-\-Pi-\-. . .-\-Pk 
of G. □ 

Frank’s theorem gives the following lower bound on the minimum number of 
edges in a 2-ECSS. 

Proposition 7. Let G = (V, E) he a 2-edge connected graph. Then e{G) > 
\V\ + g,{G)-\. 

Proof. Consider an arbitrary 2-ECSS of G. If this 2-ECSS has an ear decom- 
position with fewer than ip{G) -\- 1 even ears, then we could add the edges of 
G not in the 2-ECSS as 1-ears to get an ear decomposition of G with fewer 
than if{G) -\- 1 even ears. Thus, every ear decomposition of the 2-ECSS has 

> (p{G) -\- 1 even ears. Let Pq + ^*1 + • ■ • + -Pfc be an ear decomposition of the 2- 

ECSS, where k > p>{G). It is easily seen that the number of edges in the 2-ECSS 
is fc -I- |P| — 1 > t{G) -\- \V\ — 1. The result follows. □ 
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The next result is not useful for our main result, but we include it for com- 
pleteness. 

Proposition 8. Let G = (V, E) he a 2-edge connected graph. Let G' = {V, E') he 
obtained by discarding all the 1- ears from an ear decomposition Pq-\- P i-\- . . Pk 
of G that has <p{G) even ears of length > 2. Then \E'\/e{G) < 1.5. 

Proof. Let t be the number of internal nodes in the odd ears of Pq + -Pi + ■ • ■ + • 

(Note that the node in Pq is not counted by t.) Then, the number of edges 
contributed to E' by the odd ears is < 3t/2, and the number of edges contributed 
to E' by the even ears is < ‘p-\-\V\ — t—l. By applying Proposition 7 (and the fact 
that e{G) > |y|) we get, \E'\/e{G) < {t/2-\-ip-\-\V\ — l)/max(|l/|, Lp-\- \V\ — 1) < 
(t/2\V\) + |P| - l)/((^ + |P| - 1) < 1.5. □ 

4 Approximating e via Frank’s Theorem 

For a graph H and an ear decomposition Pq + Pi + ■ • ■ + Pfe of P, we call an 
ear Pi of length > 2 pendant if none of the internal nodes of Pi is an end node 
of another ear Pj of length > 2. In other words, if we discard all the 1-ears from 
the ear decomposition, then one of the remaining ears is called pendant if all its 
internal nodes have degree 2 in the resulting graph. 

Let G = (V, E) be the given graph, and let (p = <p{G). Recall the assumption 
from Section 1 that G is 2-node connected. By an evenmin ear decomposition of 
G, we mean an ear decomposition that has (p(G) even ears of length > 2. Our 
method starts with an open evenmin ear decomposition Pq -\- P\ P^ of G, 
see Proposition 6, i.e., for 2 < i < k, every ear Pi has distinct end nodes, and the 
number of even ears is minimum possible. The method performs a sequence of 
“ear splicings” to obtain another (evenmin) ear decomposition Qo + Qi + - ■ - + Qk 
(the ears Qi may be either open or closed) such that the following holds: 

Property (a) 

(0) the number of even ears is the same in Pq -I- Pi -I- . . . -I- Pfe and in Qo + Qi + 

• • ■ + Qk, 

(1) every 3-ear Qi is a pendant ear, 

(2) for every pair of 3-ears Qi and Qj, there is no edge between an internal node 
of Qi and an internal node of Qj, and 

(3) every 3-ear Qi is open. 

Proposition 9. Let G = (V,E) be a 2-node connected graph with \V\ > 4. Let 
Pq -\- Pi Pk he an open evenmin ear decomposition of G. There is a 

linear-time algorithm that given Pq -\- Pi Pk , finds an ear decomposition 

Qo + Qi + ■ • ■ + Qk satisfying property (a ). 

Proof. The proof is by induction on the number of ears. The result clearly holds 
for A: = 1. Suppose that the result holds for (j — 1) ears Pq -I- Pi -I- . . . -I- P^-i. Let 
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Q'o + Q'i + ■ ■ ■ + Q'j-i be the corresponding ear decomposition that satisfies prop- 
erty (a). Consider the open ear Pj, j > 2. Let Pj be an Cear, vi,V 2 , ■ ■ ■ , V£, w^+i. 
Possibly, £ = 1. (So vi and vi+\ are the end nodes of Pj, and v\ yf ut-i-i.) 

Let T denote the set of internal nodes of the 3-ears oi Q'q + Q'i + . . . + Q'j-i- 
Suppose Pj is an ear of length £ > 2 with exactly one end node, say, v\ in T. 
Let Q[ = w\,vi,wz,wa be the 3-ear having v\ as an internal node. We take 
Qo = Qo,...,Qi-i = = Q'_,_i,...,(5y_2 = Q'j-i- Moreover, we take 

Qj-i to be the (^-|-2)-ear obtained by adding the last two edges of Q' to Pj, i.e., 
Qj-i = W 4 , ws, vi,V 2 , . ■ . ,Vi, V£+i, and we take Qj to be the l-ear consisting of 
the first edge w\V\ of Q[. Note that the parities of the lengths of the two spliced 
ears are preserved, that is, Qj-i is even (odd) if and only if Pj is even (odd), 
and both Qj and Q[ are odd. Hence, the number of even ears is the same in 
Pq -\- Pi Pj and in Qo + Qi + . . . + Qj- 

Now, suppose Pj has both end nodes vi and in T. If there is one 3-ear 
Q' that has both v\ and vi+i as internal nodes (so £ > 2), then we take Qj-i 
to be the {£ + 2)-ear obtained by adding the first edge and the last edge of Q' 
to Pj, and we take Qj to be the 1-ear consisting of the middle edge of 

Qi- Also, we take Qq — Qq, - - - , Qi—i — Qi—i,Qi — ^ ^ Q j —2 — Q j—i- 

Observe that the number of even ears is the same in Pq + + ■ • ■ + and in 

Qo + Qi + . . . + Qj. 

If there are two 3-ears Q' and Q), that contain the end nodes of Pj , then we 
take Qj -2 to be the (^-|-4)-ear obtained by adding the last two edges of both Q' 
and Q'f, to Pj, and we take Qj-i (similarly, Qj) to be the 1-ear consisting of the 
first edge of Q' (similarly, Q'fJ- (For ease of description, assume that if a 3-ear 
has exactly one end node v of Pj as an internal node, then v is the second node 
of the 3-ear.) Also, assuming i < h, we take Qo = Qq, - - - , Qi-i = Q)_i, Qi = 
Q^+i, ■■■! Qh -2 = Q'h_i,Qh-i = Q'h+iJ • ■ • ) Qi -3 = Q'j-i- Again, observe that 
the number of even ears is the same in Pq + P\ + - - - + Pj and in Qo -I- Qi -I- . . . -I- Qj . 

If the end nodes of Pj are disjoint from T, then the proof is easy (take 
Qj = Pj)- Also, if Pj is a 1-ear with exactly one end node in T, then the proof 
is easy (take Qj = Pj)- 

The proof ensures that in the final ear decomposition Qo + Qi + ■ . ■ + Qk, 
every 3-ear is pendant and open, and moreover, the internal nodes of distinct 3- 
ears are nonadjacent. We leave the detailed verification to the reader. Therefore, 
the ear decomposition Qo + Qi + ■ . ■ + Qk satisfies property (a). □ 

Remark 10- In the induction step, which applies for j > 2 (but not for j = 1), 
it is essential that the ear Pj is open, though Q' (and Q),) may be either open 
or closed. Our main result (Theorem 12) does not use part (3) of property (a). 

Our approximation algorithm for a minimum-size 2-ECSS computes the ear 
decomposition Qo + Qi + ■ . ■ + Qk satisfying property (a), starting from an open 
evenmin ear decomposition Pq + Pi + - - - + Pk- (Note that Qo + Qi + . ■ . + Qfc 
is an evenmin ear decomposition.) Then, the algorithm discards all the edges 
in 1-ears. Let the resulting graph be G' = {V,E')- G' is 2-edge connected by 
Proposition 1. 
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Let T denote the set of internal nodes of the 3-ears of Qo + Qi + • ■ - + Qk, and 
let t = \T\. (Note that the node in Qo is not counted by t.) Property (a) implies 
that in the subgraph of G induced by T, G[T], every (connected) component 
has exactly two nodes. Consider the approximation guarantee for G", i.e., the 
quantity \E'\/e{G). 

Lemma 11. e(G) > 3t/2. 

Proof. Apply Proposition 4 with S = V — T (so \S\ = n — t) and c = c(G — S) = 
t/2 to get e(G) >n—{n — t) + (t/2). □ 

Theorem 12. Given a 2-edge connected graph G = (V,E), the above algorithm 
finds a 2-ECSS G' = (V,E') such that |if'|/e(G) < The algorithm runs in 
time 0{\V\ ■ |A|). 

Proof. By the previous lemma and Proposition 7, 

e(G) > max(n -|- <p(G) — 1, 3t/2) . 

We claim that 

1^,1 ^ t , 5{n-^ip{G) - 1) 

i ■ 

To see this, note that the final ear decomposition Qo + Qi + . ■ . + Qk satisfies 
the following: (i) the number of edges contributed by the 3-ears is 3t/2; (ii) the 
number of edges contributed by the odd ears of length > 5 is < 5g/4, where q is 
the number of internal nodes in the odd ears of length > 5; and (iii) the number 
of edges contributed by the even ears of length > 2 is < (fi{G) {n — t — q — 1), 
since there are ip{G) such ears and they have a total of {n — t — q — 1) internal 
nodes. (The node in Qo is not an internal node of an ear of length >1.) 

The approximation guarantee follows since 

\E'\ ^ t/4 + 5(n + (p(G)-l)/4 
e{G) - e{G) 

^ t/4 -I- 5(n -I- (^(G) - l)/4 
“ max(n -|- (p{G) — 1, 3t/2) 

t 2 5(n -I- ip{G) — 1) 1 

“ 4 3t 4 n-\- <p{G) — 1 

_ n 
~ 12 ■ 

□ 



5 Relation to the TSP | Conjecture 

This section shows that the well known | conjecture for the metric TSP (due 
to Cunningham (1986) and others) implies that there is a ^-approximation al- 
gorithm for a minimum-size 2-ECSS, see Theorem 18. Almost all of the results 
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in this section are well known, except possibly Fact 13, see [1,3,5,7,11,13]. The 
details are included to make the paper self-contained. 

In the metric TSP (traveling salesman problem), we are given a complete 
graph G' = and edge costs c' that satisfy the triangle inequality < 
c'^u + Vw, tc, u G V). The goal is to compute c^gp, the minimum cost of a 
Hamiltonian cycle. 

Recall our 2-ECSS problem: Given a simple graph G = {V, E), compute e(G), 
the minimum size of a 2-edge connected spanning subgraph. Here is the multiedge 
(or uncapacitated) version of our problem. Given G = (V, E) as above, compute 
fJ,{G), the minimum size (counting multiplicities) of a 2-edge connected spanning 
submultigraph E[ = {V,F), where F is a multiset such that e G F e G E. 
(To give an analogy, if we take s{G) to correspond to the /-factor problem, then 
/r(G) corresponds to the /-matching problem.) 

Fact 13. If G is a 2-edge connected graph, then /i(G) = s{G). 

Proof. Let H = {V, F) give the optimal solution for fi{G). If H uses two copies 
of an edge vw, then we can replace one of the copies by some other edge of G 
in the cut given hy H — {vw, vw}. In other words, if S is the node set of one of 
the two components of H — {vw, uw}, then we replace one copy of vw by some 
edge from 6 a{S) — {uw}. □ 



Remark I 4 . The above is a lucky fact. It fails to generalize, both for minimum- 
cost (rather than minimum-size) 2-EGSS, and for minimum-size fc-EGSS, fc > 3. 

Given an n-node graph G = (V,E) together with edge costs c (possibly c 
assigns unit costs), define its metric completion G',c' to be the complete graph 
Kn = G' with {y v,w G V) equal to the minimum-cost of a v-w path in G, c. 

Fact 15. Let G be a 2-edge connected graph, and let c assign unit costs to the 
edges. The minimum cost of the TSP on the metric completion of G, c, satisfies 
4sp>m(G) = £(G). 

Proof. Let T be an optimal solution to the TSP. We replace each edge vw G 
E{T) — E{G) by the edges of a minimum-cost v-w path in G, c. The resulting 
multigraph H is obviously 2-edge connected, and has c'rpgp = c{H) > /i(G). □ 

Here is the subtour formulation of the TSP on G' ,d , where G' = AT„. This 
gives an integer programming formulation, using the subtour elimination con- 
straints. There is one variable Xe for each edge e in GL 

c'tsp = minimize c' ■ x 

subject to x((5(u)) =2, Vu G P 

xls{s)) >2, ys CV, S 

X > 0 , 

G ^ . 



X 
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The subtour LP (linear program) is obtained by removing the integrality con- 
straints, i.e., the a;- variables are nonnegative reals rather than nonnegative in- 
tegers. Let zsT denote the optimal value of the subtour LP. Note that zst is 
computable in polynomial time, e.g., via the Ellipsoid method. In practice, zst 
may be computed via the Held-Karp heuristic, which typically runs fast. 

3 

Theorem 16 (Wolsey [13]). If d is a metric, then cfgp < ^^ST- 

TSP I Conjecture. If d is a metric, then dpsp ^ tZst- 

o 

To derive the lower bound zst < s{G), we need a result of Goemans & 
Bertsimas on the subtour LP, [7, Theorem 1]. In fact, a special case of this result 
that appeared earlier in [11, Theorem 8] suffices for us. 

Proposition 17 (Parsimonious property [7]). Consider the TSP on G' = 

(V,E'),d, where G' = K\y\. Assume that the edge costs d form a metric, i.e., 
d satisfies the triangle inequality. Then the optimal value of the subtour LP 
remains the same even if the constraints |a;((5(u)) = 2,Vu € V} are omitted. 

Note that this result does not apply to the subtour integer program given 
above. 

Let Z 2 CUT denote the optimal value of the LP obtained from the subtour LP 
by removing the constraints x{6{v)) = 2 for all nodes v G V. The above result 
states that if d is a metric, then zst = z^cut- Moreover, for a 2-edge connected 
graph G and unit edge costs c = 1, we have Z 2 CUT < m(G') = ^{G), since ^(G) is 
the optimal value of the integer program whose LP relaxation has optimal value 
Z 2 CUT- (Here, Z 2 CUT is the optimal value of the LP on the metric completion of 
G, c.) Then, by the parsimonious property, we have zst = Z2CUT < £(G). The 
main result in this section follows. 

Theorem 18. Suppose that the TSP | conjecture holds. Then 

4 

Zst Si s(G) < dpsp If 3^'^^ ’ 

A ^-approximation of the minimum-size 2-ECSS is obtained by computing 
^zsT on the metric completion of G,c, where c = 1. 



The Minimum- Cost 2-ECSS Problem 

Consider the weighted version of the problem, where each edge e has a nonnega- 
tive cost Ce and the goal is to find a 2-ECSS {V, E') of the given graph G = {V, E) 
such that the cost c{E') = minimum. Khuller & Vishkin [8] pointed 

out that a 2-approximation guarantee can be obtained via the weighted matroid 
intersection algorithm. When the edge costs satisfy the triangle inequality (i.e., 
when c is a metric), Frederickson and Ja’Ja’ [5] gave a 1.5- approximation algo- 
rithm, and this is still the best approximation guarantee known. In fact, they 
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proved that the TSP tour found by the Christofides heuristic achieves an approx- 
imation guarantee of 1.5. Simpler proofs of this result based on Theorem 16 were 
found later by Cunningham (see [11, Theorem 8]) and by Goemans & Bertsimas 
[7, Theorem 4] . 

Consider the minimum-cost 2-ECSS problem on a 2-edge connected graph 
G = (V, E) with nonnegative edge costs c. Let the minimum cost of a simple 2- 
ECSS and of a multiedge 2-ECSS be denoted by Cg and c^, respectively. Clearly, 
Ce> c^. Even for the case of arbitrary nonnegative costs c, we know of no exam- 

c 7 cl 

pie where — — > - . There is an example G, c with — — > - . Take two copies of 
ZST 6 ZST 6 

iCa, call them Gi,G2, and add three disjoint length-2 paths Pi,P 2 ,P 3 between 
Gi and C 2 such that each node of Gi U C 2 has degree 3 in the resulting graph G. 
In other words, G is obtained from the triangular prism Cq by subdividing once 
each of the 3 “matching edges”. Assign a cost of 2 to each edge in Gi U G2, and 
assign a cost of 1 to the remaining edges. Then = 14, as can be seen by 

taking 2 edges from each of Gi, G2, and all 6 edges of Pi U P2 U P3. Moreover, 
ZST < 12, as can be seen by taking a;e = 1/2 for each of the 6 edges e in Gi U G2, 
and taking Xe = 1 for the remaining 6 edges e in Pi U P2 U P3 . 



6 Conclusions 

Our analysis of the heuristic is (asymptotically) tight. We give two example 
graphs. Each is an n-node Hamiltonian graph G = (V,P), where the heuristic 
(in the worst case) finds a 2-ECSS G' = {V,E') with 17n/12 — G(l) edges. 
The first example graph, G, is constructed by “joining” many copies of the 
following graph PI: H consists of a 5-edge path uq, ui, ^2,^3, U 4 , M5, and 4 disjoint 
edges viWi,V 2 W 2 ,V 3 W 3 ,V 4 W 4 . We take q copies of El and identify the node uq 
in all copies, and identify the node U5 in all copies. Then we add all possible 
edges UiVj, and all possible edges UiWj, i.e., we add the edge set of a complete 
bipartite graph on all the M-nodes and all the u-nodes, and we add the edge 
set of another complete bipartite graph on all the rt-nodes and all the w-nodes. 
Finally, we add 3 more nodes wj, U3 and 5 more edges to obtain a 5-edge cycle 
Mo, u[, u' 2 , M3, Ms, mq. Clearly, e(G) = n = 12q + 5. If the heuristic starts with the 
closed 5-ear mo,m/,M2,M3,ms,mo, and then finds the 5-ears mo,mi,M2,M3,M4,ms in all 
the copies of H, and finally finds the 3-ears uoVjWjU^ (1 < j < 4) in all the 
copies of P[, then we have jP'j = 17q + 5. 

Here is the second example graph, G = (V) E). The number of nodes is n = 
3 X 5^, and V = {0, 1, 2, ..., 3 x 5'^— 1}. The “first node” 0 will also be denoted 3 x 
5'^. The edge set E consists of (the edge set of) a Hamiltonian cycle together with 
(the edge sets of) “shortcut cycles” of lengths n/3,n/(3 x 5),n/(3 x 5^), ... ,5. 
In detail, E = | VO < i < <7 — 1}U {(3 x 5^ x i)(3 x x (i-|- 1)) | VO < j < 

<7-1,0 < i < 5«-^-l}.Note that jPj = 3x5«-b5«-b5«-i-b...-b5 = (17x5«-5)/4. 
In the worst case, the heuristic initially finds 5-ears, and finally finds 3-ears, and 
so the 2-ECSS (V, E') found by the heuristic has all the edges of G. Hence, we 
have |P'|/£(G) = |P|^ = 17/12 - 1/(12 x 5«"i). 
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How do the lower bounds in Proposition 4 (call it Lc) and in Proposition 7 
(call it L^) compare with e? Let n denote the number of nodes in the graph. 
There is a 2-node connected graph such that ejL^p > 1.5 — 0(l)/n, i.e., the upper 
bound of Proposition 8 is tight. There is another 2-edge connected (but not 2- 
node connected) graph such that ejLc > 1.5 — 6>(l)/n and ejL^ > 1.5 — 6>(l)/n. 
Among 2-node connected graphs, we have a graph with ejLc > 4/3 — 0{\)/n, 
but we do not know whether there exist graphs that give higher ratios. There is 
a 2-node connected graph such that ej max(Lc, L^) > 5/4 — 0{l)/n, but we do 
not know whether there exist graphs that give higher ratios. 
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Abstract. The Multicut problem is defined as follows: given a graph 
G and a collection of pairs of distinct vertices of G, find a small- 

est set of edges of G whose removal disconnects each Si from the corre- 
sponding ti. Our main result is a polynomial-time approximation scheme 
for Multicut in unweighted graphs with bounded degree and bounded 
tree-width: for any e > 0, we presented a polynomial-time algorithm 
with performance ratio at most 1 -|- e. In the particular case when the 
input is a bounded-degree tree, we have a linear-time implementation of 
the algorithm. We also provided some hardness results. We proved that 
Multicut is still NP-hard for binary trees and that, unless P = NP, 
no polynomial-time approximation scheme exists if we drop any of the 
the three conditions: unweighted, bounded- degree, bounded-tree- width. 
Some of these results extend to the vertex version of Multicut. 



1 Introduction 

Multicommodity Flow problems have been intensely studied for decades [7,11,9], 
[13,15,17] because of their practical applications and also of the appealing hard- 
ness of several of their versions. The fractional version of a Multicut problem 
is the dual of a Multicommodity Flow problem and, therefore. Multicut is of 
similar interest [3,9,10,13,20]. 
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The Weighted Multicut is the following problem: given an undirected 
graph G, a weight function w on the edges of G, and a collection of k pairs of 
distinct vertices of G, find a minimum weight set of edges of G whose 

removal disconnects each Si from the corresponding ti . 

The particular case in which fc = 1 is characterized by the famous Max- 
Flow Min- Cut Theorem [6], and is solvable in strongly polynomial time [4]. For 
fc = 2, a variant of the Max-Flow Min-Cut Theorem holds [11,12] and Multicut 
is solvable in polynomial time. For fc > 3, the problem is NP-hard [3]. 

Since many variants of the Weighted Multicut are known to be NP-hard, 
we search for efficient approximation algorithms. The performance ratio of an 
approximation algorithm A for a minimization problem is the supremum, over 
all possible instances /, of the ratio between the weight of the output of A when 
running on I and the weight of an optimal solution for I. We say A is an a- 
approximation algorithm if its performance ratio is at most a. The smaller the 
performance ratio, the better. 

The best known performance ratio for Weighted Multicut in general graphs is 
0(logA:) [10]. Important research has been done for improving the performance 
ratio when the input graph G belongs to special classes of graphs. For planar 
graphs, Tardos and Vazirani [20], see also [13], give an approximate Max-Flow 
Min-Cut theorem and an algorithm with a constant performance ratio. 

The case when the input graph is restricted to a tree has been studied in 
[9]. Unweighted Multicut problem (in which w{e) = 1 for all edges e of G) 
restricted to stars (trees of height one) is equivalent (including performance ratio) 
to Minimum Vertex Cover, by Proposition 1 in [9]. It follows that Unweighted 
Multicut restricted to stars is NP-hard and Max SNP-hard. In fact, getting a 
performance ratio better than two seems very hard, since getting a performance 
ratio better than two for Minimum Vertex Cover remains a challenging open 
problem [16]. Garg, Vazirani and Yannakakis give an algorithm in [9] with a 
performance ratio of two for the Weighted Multicut problem in trees. Note that 
the integral unweighted Multicommodity Flow problem in trees is solvable in 
polynomial time [9] . 

We find useful two variations of the Multicut problem. The Vertex Mul- 
ticut problem is: given an undirected graph G and a collection of k pairs of 
distinct nonadjacent vertices (si,ti) of G called terminals, find a minimum set 
of nonterminal vertices whose removal disconnects each si from the correspond- 
ing ti - The Unrestricted Vertex Multicut problem is: given an undirected 
graph G and a collection of k pairs of vertices (sj, tf) of G called terminals, find 
a minimum set of vertices whose removal disconnects each Si from the corre- 
sponding ti. (Note that in this variation, terminals might be removed.) Observe 
that Vertex Multicut is at least as hard as Unrestricted Vertex Multicut. From 
an instance of Unrestricted Vertex Multicut we can obtain an instance of Vertex 
Multicut by adding, for each Si, a new vertex s' adjacent only to Si, and, for 
each ti, a new vertex t[ adjacent only to U. Each pair (si,ti) is substituted by 
the new pair {s'i,t'i). Solving Vertex Multicut in this instance is equivalent to 
solving Unrestricted Vertex Multicut in the original instance. 
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Both Vertex Multicut and Unrestricted Vertex Multicut might be of interest 
on their own. Garg, Vazirani and Yannakakis considered the weighted version of 
Vertex Multicut and proved that their algorithm in [10] achieves a performance 
ratio of 0(log k) for the weighted version of Vertex Multicut in general graphs. 

From now on, we refer to Multicut as Edge Multicut, to avoid confusion. 
Let us mention some results we obtained for Vertex Multicut and Unrestricted 
Vertex Multicut. We have a proof that Vertex Multicut is NP-hard in bounded- 
degree trees. Unrestricted Vertex Multicut is easier: it is polynomially solvable 
in trees, but it becomes NP-hard in bounded-degree series-parallel graphs. 

The tree-width notion (first introduced by Robertson and Seymour [19]) 
seems to often capture a property of the input graph which makes hard problems 
easy. Various NP-hard problems, like Clique or Coloring, have a polynomial-time 
algorithm (linear time in fact) if the input graph has bounded tree- width (see 
for example [2]). We will present the formal definition of tree- width in Section 2. 

Bounded tree-width can also be used to obtain good approximation algo- 
rithms for those problems that remain NP-hard even if restricted to graphs of 
bounded tree-width. In our case. Unrestricted Vertex Multicut is NP-hard in 
graphs of tree-width at most two, since this class of graphs coincides with the 
series-parallel graphs (see for example [21]). We give a straightforward PTAS 
for Unrestricted Vertex Multicut in graphs of bounded tree-width. 

We present an approximation-ratio preserving reduction from Edge Multi- 
cut to Unrestricted Vertex Multicut. If the Edge Multicut instance graph has 
bounded degree and bounded tree-width, the Unrestricted Vertex Multicut in- 
stance obtained by the reduction has bounded tree-width. Combining the re- 
duction with the PTAS for Unrestricted Vertex Multicut in graphs of bounded 
tree-width, we obtain a PTAS for Unweighted Edge Multicut in graphs with 
bounded degree and bounded tree- width. This is the main result of the paper. 
Note that, according to [8, page 140, Theorem 6.8], a FPTAS cannot exist for 
this problem, unless P=NP. 

We also have a linear-time implementation of our PTAS for Edge Multicut 
in bounded-degree trees. The running time of our implementation is 0{(n + 
A:) [l/e]d‘^ri/£l+2)^ where n is the number of vertices of the tree, k is the number 
of (si,ti) pairs, d is the maximum degree of the tree and 1 -I- e is the desired 
approximation ratio of the algorithm. The size of the input is 0{n + k). 

We show that Edge Multicut is still NP-hard for binary (degree bounded by 
three) trees. Thus, on the class of graphs of bounded degree and bounded tree- 
width, which contains binary trees. Edge Multicut is easier (there is a PTAS) 
than on general graphs, yet still NP-hard. Identifying this class is the main 
theoretical result of this paper. 

Hardness results indicate why we cannot eliminate any of the three res- 
trictions — unweighted, bounded degree and bounded tree-width — on the input 
graph and still obtain a PTAS. It is known [1] that for a Max SNP-hard problem, 
unless P=NP, no PTAS exists. We have already seen that Unweighted Edge Mul- 
ticut is Max SNP-hard in stars [9], so letting the input graph have unbounded 
degree makes the problem harder. We show that Weighted Edge Multicut is Max 
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SNP-hard in binary trees, therefore letting the input graph be weighted makes 
the problem harder. Finally, we show that Unweighted Edge Multicut is Max 
SNP-hard if the input graphs are walls. Walls, to be formally defined in Sec- 
tion 4, have degree at most three and there are walls with tree-width as large as 
we wish. We conclude that letting the input graph have unbounded tree-width 
makes the problem significantly harder. 

In Section 2 we present the polynomial-time algorithm for Unrestricted Ver- 
tex Multicut in trees and the polynomial-time approximation scheme for Unre- 
stricted Vertex Multicut in bounded-tree- width graphs. In Section 3, we show the 
approximation-preserving reduction from Edge Multicut to Unrestricted Vertex 
Multicut. Finally, in Section 4 we present our hardness results. 



2 Algorithms for Unrestricted Vertex Multicut 

In this section we concentrate on Unrestricted Vertex Multicut. We present a 
polynomial-time algorithm for trees and a PTAS for graphs with bounded tree- 
width. Let us start defining tree-width. 

Let G be a graph and 0 be a pair (T, which consists of a tree 

T and a multiset whose elements X^, indexed by the vertices of T, are subsets of 
V{G). For a vertex v of G, we denote by the subgraph of T induced by those 
vertices w of T for which X^, contains v. Then 0 is called a tree decomposition 
of G if it satisfies the two conditions below: 

(1) For every edge e = xy oi G, there is a vertex w of T such that {x, y} C 

Xw 7 

(2) For every vertex v of G, the subgraph of T is a tree. 

The width of 0 is the maximum, over all vertices w of T, of |Vu,| — 1, and 
the tree-width of G, denoted by tw{G), is the minimum of the widths of all tree 
decompositions of G. 

Consider an instance of Unrestricted Vertex Multicut, that is, a graph G = 
(V,E) and a set C of pairs (si,C) of vertices of G. We say a pair (si,ti) in C is 
disconnected by a set A C U if Si is disconnected from U in the subgraph of G 
induced by U — S'. A set S is a solution for G if S disconnects all pairs (si,C) 
in C. If S has minimum size (i.e., minimum number of vertices), then S is an 
optimal solution for G. 

Now, let us describe the polynomial-time algorithm for trees. The input of 
the algorithm is a tree T and a set C of pairs {si,ti) of vertices of T. 

Consider the tree T rooted at an arbitrary vertex and consider also an ar- 
bitrary ordering of the children of each vertex (so that we can talk about pos- 
torder). 

Algorithm: 

Input: a tree T. 

Start with S = 0. 

Call a pair {si,ti) in C active if it is not disconnected by S. 

Traverse the tree in postorder. 
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When visiting vertex v, if v is the least common ancestor of some active 
pair in C, then insert v into S and mark 

Output S. 

Clearly the following invariant holds: all non-active pairs in C are discon- 
nected by S. A pair in C that becomes non-active does not go back to active 
since we never remove vertices from S. At the end of the algorithm, no pair in 
C is active, meaning that S' is a solution for the problem. For the minimality 
of S, note that the paths joining Si to U in T for all marked pairs (si,ti) form 
a pairwise disjoint collection of paths. Any solution should contain at least one 
vertex in each of these paths. But there are |S| marked paths, meaning that 
any solution has at least |S| vertices. This implies that |S| is a minimum-size 
solution. Besides, it is not hard to see that the algorithm can be implemented 
in polynomial time. 



2.1 Bounded- Tree-Width Graphs 

Next we present a PTAS for Unrestricted Vertex Multicut in graphs with bounded 
tree-width. A PTAS consists of, for each e > 0, a polynomial-time algorithm for 
the problem with a performance ratio of at most 1 -I- e. Let us describe such an 
algorithm. 

The input of our algorithm is a graph G = (V,E), a tree decomposition 
0 = (T, {Xu])wev(T)) of G, and a set C of pairs of vertices of G. 

Given a subgraph G' of G, denote by C(G') the set of pairs in C whose two 
vertices are in G', andby G\G' the subgraph of G induced by U(G)\U(G'). For 
the description of the algorithm, all the instances we mention are on a subgraph 
G' of G and the set of pairs to be disconnected is C(G'). So we will drop C(G') 
of the notation and refer to an instance only by the graph Gb Denote by opt(G') 
the size (i.e., the number of vertices) of an optimal solution for G' . 

Root the tree T (of the given tree decomposition) at a vertex r and consider 
an arbitrary ordering of the children of each vertex of T . For a vertex u oiT , let 
T{u) be the subtree of T rooted at u. Let G{u) be the subgraph of G induced 
by the union of all w G U(T(u)). Let t = \(tw{G) + l)/e]. 

Here is a general description of the algorithm: label the vertices of T in 
postorder. Find the lowest labeled vertex u such that an optimal solution for 
G{u) has at least t vertices. If there is no such vertex, let u be the root. Find 
an approximate solution Su for G{u) such that |S'„| < (1 -I- e)opt{G{u)) and 
Xu C Su- If u is the root of T, then output Su- Otherwise, let G' = G\G{u) and 
let 6>' = (T', (X!^)u,ev{T')) be the tree decomposition of G' where T' = T\T{u) 
and X'^ = Xu, \ V{G{u)), for all w G V{T')- Recursively get a solution S' for 
G'- Output S = S' \JSu- 

Next we present a detailed description of the algorithm. It works in iterations. 
Iteration i starts with a subgraph G'~^ of G, a tree decomposition 0®“^ = 

G*“^ with rooted at r, and a set S'~^ of vertices 
of G. Initially, G° = G, 0° = 0, S'® = 0 and i = \- The algorithm halts when 
G'~^ = 0. When G'~^ is nonempty, the algorithm starts calling a procedure 
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Get {u, A), which returns a vertex u of and a solution A for such 

that | 2 l| < (1 + e) opt{G''~^ (u)) and C A. Then the algorithm starts a new 
iteration with G* = G*“^ \ G®“^(w), G* = (T*, where T* = \ 

T*-i(u) and X; = \ V{G^-^{u)), for all w G v[t^), and S'* = S*"i U 

The formal description of the algorithm appears in Figure 2.1. 



Algorithm: 

G°^G; 

0° ^ 0; 

S° ^ 0; 
i ^ 1; 

while G*“^ 7 ^ 0 do 

Get (Mi, A*); /* |A*| < (l + e)opt(G*-i(Mi)) and a;-! C A* */ 

G* ^ G*-i \G*-i(Mi); 

yi ^ yi-l 

Ai ^ A^-i \ V{G^-\ui)), for each w G F(T*); 

S* ^ S*-i U Ah 
i ^ i + 1; 
endwhile; 

/ ^ - 1 ; 
output S^. 



Fig. 1. The algorithm for Unrestricted Vertex Multicut in bounded-tree-width 
graphs. 



We will postpone the description of Get {u, A) and, for now, assume that it 
works correctly and in polynomial time. The next lemma states a property of 
tree decompositions that we will use later. 

Lemma 1. Gonsider a graph G and a tree decomposition 0 = (T, {X^)^^viT)) 
of G. Let u be a vertex ofT, x he a vertex of G{u) and y he a vertex of G\G{u). 
Then any path in G from x to y contains a vertex of A„ . 

Next we prove that the output of the algorithm is in fact a solution. 

Lemma 2. is a solution for G. 

Proof. Let (s, t) be a pair in C and P be a path in G from s to t. We need to 
show that there is a vertex of P in . Note that the vertex sets U(G*’ 
define a partition of U(G). 

Let i be such that s is in G*“^(wi). If all vertices of P lie in G*“^(wi) then, 
in particular, both s and t are in G^~^{ui), which means (s,t) G C{G^~^{ui)). 
Since S^ contains a solution for G^~^{ui), must contain a vertex of P. 

If not all vertices of P lie in G*“^(Mi), let y be the first vertex of P that does 
not lie in G^~^{ui). If y is in G*“^ \G*“^(wi) then, by Lemma 1, there is a vertex 
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of in the segment of P from s to y. Since C , there is a vertex of 

P in . If y is not in \ then y is not in G®“^. This means y is in 

for some j < i. Moreover, s is in \ G^~^(uj) (because this is a 

supergraph of G*“^). Again by Lemma 1, there is a vertex of XP~^ C in P, 
concluding the proof of the lemma. ■ 

The next lemma proves that the performance ratio of the algorithm is at 
most 1 + e. 

Lemma 3. 15^1 < (1 + e)opt{G). 

Proof. We have that 



/ / 

|^^| = £|A*| < ^{l + e)opt{G^-\u.)) 

i=l i=l 

f 

= (1 + e) ^ opt(G*“^(Mj)) < (l + e)opt(G), 

because the subgraphs G*“^(ui) are vertex disjoint. ■ 

Now we proceed with the description of a straightforward polynomial-time 
implementation of Get (u,A). 

Search the vertices of the tree in postorder. Stop if the vertex u being 
visited is either the root or is such that opt{G'^~^ (u)) > t. Let us show how we 
check whether opt{G^~^ (u)) > t in polynomial time. 

If we are searching vertex u, it is because all children of u have been searched 
and have an optimal solution with less than t vertices. Compute an optimal 
solution for each child v of u. This can be done in time by brute force: 

check all subsets of G{v) of size at most t. The time is polynomial, since t = 
[(tw(G) -I- l)/e] is fixed. Let s be the sum of the sizes of the solutions for the 
children of u. 

Let us show that the optimum of G*“^(u) is at most s + tw{G) + 1. We do 
this by presenting a solution B for G*“^(m) of size at most s + tw{G) + 1. The 
set B is the union of A„ and an optimal solution for G*“^(u), for each child v 
of u. Thus \B\ < \Xu\ -I- s < tw{G) -I- 1 -I- s. Now we must prove that B is in 
fact a solution for G*“^(u). Let (s,t) be a pair in C(G*“^(m)) and P be a path 
in G*“^(u) from s to t. We need to show that there is a vertex of P in B. If 
there is a vertex of P in A„, then clearly B contains a vertex of P. If, on the 
other hand, P contains no vertex of A„, we must have all vertices of P in the 
same G*“^(u), for some child v of u, by Lemma 1. But B contains a solution for 
G*“^(u). Therefore, B contains a vertex of P. This completes the proof that B is 
a solution for G*“^(m), and so the optimum of G®“^(m) is at most s + tw{G) + 1. 

Now, let us proceed with the description of Get {u,A). If s < t, then 
opt{G{u)) < s -I- tw{G) -I- 1 < t -I- tw{G) + 1, and we can compute in poly- 
nomial time an optimal solution Aq for G{u) (by brute force in 
time, which is polynomial since t = [(tw(G) -I- l)/e]). If |Ao| < t then pro- 
ceed to the next vertex in postorder. If |Ao| > t, then we output u and the 
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set ^ = ^0 U Xu- Note that in fact opt{G^~^ {u)) = |Ao| > t, Xu C A and 
\A\ < opt{G^~^ (u)) + (tw{G) + 1) < opt{G^~^ (u)) + te < (1 + e)opt(G®“^(w)), 
as desired. On the other hand, if s > t, then t < s < opt{G'^~^ (u)) < s + 
tw{G) + 1 < s + tt < opt{G'‘~^ {u)) + opt{G^~^ {u))e = (1 + e)opt{G'‘~^{u)). 
Thus B (from the previous paragraph) is a solution for of size at most 

s + tw{G) + !<(! + t)opt{G'‘~^{u)) that can be computed in polynomial time. 
Moreover, C B. So in this case we output u and A = B. This finishes the 
description of Get {u,A). 



3 Edge Multicut 

In this section we show that Edge Multicut can be reduced to Unrestricted 
Vertex Multicut by a reduction that preserves approximability. 

The reduction has the following property. If the instance of Edge Multicut is 
a graph with bounded degree and bounded tree-width, then the corresponding 
instance of Unrestricted Vertex Multicut has bounded tree-width. 

Given a graph G = (U, E), the line graph of G is the graph whose vertex set 
is E and such that two of its vertices (edges of G) are adjacent if they share an 
endpoint in G. In other words, the line graph of G is the graph (E,L), where 
L = {e/ : e, f € E and e and / have a common endpoint}. 

Consider an instance of Edge Multicut, that is, a graph G = (U, E) and a set C 
of pairs of distinct vertices of G. Let us describe the corresponding instance of 
Unrestricted Vertex Multicut. The input graph for Unrestricted Vertex Multicut 
is the line graph of G, denoted by G'. Now let us describe the set of pairs of 
vertices of G'. For each pair (s,t) in C, we have in C all pairs (e, /) such that e 
has s as endpoint and / has t as endpoint. 

Clearly G' can be obtained from G in polynomial time. Note that C has at 
most kA^ pairs, where k = \C\ and A is the maximum degree of G. Also C can 
be obtained from G and C in polynomial time. 

The following theorem completes the reduction. 

Theorem 4. S is a solution for Edge Multicut in G if and only if S is a solution 
for Unrestricted Vertex Multicut in G' . 

Proof. Consider a solution S of Edge Multicut in G, that is, a set S of edges of 
G such that any pair in C is disconnected in (U, E — S). Note that S C E{G) = 
U(G'). Let us verify that the removal of S from G' disconnects all pairs in C . 
For any pair (e, /) in C , there are s and t in U (G) such that s is an endpoint 
of e, t is an endpoint of / and the pair (s,t) is in C. Moreover, a path P' in G' 
from e to / corresponds to a path P in G from s to t whose edges are a subset of 
the vertices in P' (which are edges of G) . Since S' is a solution of Edge Multicut 
in G, there must be an edge of P in S, which means that there is a vertex of P' 
in S. Hence S is a solution for Unrestricted Vertex Multicut in G' . 

Conversely, let S be a solution for Unrestricted Vertex Multicut in G', that 
is, S is a set of edges of G whose removal from G' disconnects all pairs of vertices 
of G' in C . Let (s, t) be a pair in C, and P a path in G from s to t. (Recall that. 
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by the description of Edge Multicut, s yf t.) Let e be the first edge of P and / 
the last one (possibly e=f). Clearly s is incident to e, and t to /. Thus (e, /) is a 
pair in Cb Corresponding to P, there is a path P' in G' from e to f containing 
as vertices all edges of P. Since S' is a solution for Unrestricted Vertex Multicut 
in G' and (e, /) is in C , S must contain a vertex of P' . Therefore there is an 
edge of P in S, which implies that S is a solution of Edge Multicut in G. ■ 

The next lemma shows the previously mentioned property of this reduction. 

Lemma 5. If G has bounded degree and bounded tree-width, then the line graph 
of G has bounded tree-width. 

Proof. Denote by G' the line graph of G. Let us present a tree decomposition 
of G' whose tree-width is at most (tw{G) -\- l)A, where A is the maximum degree 
of G. 

Let O = {T, {Xu)uev(T)) be a tree decomposition of G of width tw{G). For 
each u e V(T), let Vu be the set of edges of G incident to some vertex in Xu- 
First let us prove that G' = {T, {Yu)uev(T)) is a tree decomposition of G'. For 
this, given an edge e of G, denote by Eg the subgraph of T induced by those 
vertices in T for which Yu contains e. We shall prove that (1) any edge h of G' 
has both endpoints in Yu, for some u in V{T); and (2) that Eg is a tree for any 
edge e of G'. 

The endpoints of an edge h of G' are two edges e and / of G with a common 
endpoint, say, v. But v G Xu for some u G V(T). This implies that both e and 
/ belong to V„, proving (1). For (2), let e be a vertex of G', that is, an edge 
e = xy of G. For any u such that e G Yu, we must have that either x G Xu or 
y G Xu- Therefore Eg = E^, U E^. We know that the subgraphs and Ty of E 
are subtrees of E. Moreover, and Ty have a vertex in common, because both 
x and y belong to the same Xu, for some u G V (E). Hence Eg is a subtree of E. 
This completes the proof that O' is a tree decomposition of G'. 

To verify that the width of O' is at most (tw{G) -\- 1)A, just note that 
|V„|<|V„|Z\, for ally GU(E). ■ 

The next corollary is a consequence of the previous reduction and the PTAS 
given in Section 2.1. 

Lemma 6. There is a PTAS for Edge Multicut in bounded- degree graphs with 
bounded tree-width. 

In fact we know how to implement the PTAS given in Section 2.1, for Edge 
Multicut in bounded-degree trees, in time 0((n -|- A:) l’l/ejd‘^ri/^1+2)^ where n 
is the number of vertices of the tree, k is the number of (si,ti) pairs, d is the 
maximum degree of the tree and 1 -I- e is the desired approximation ratio of the 
algorithm. The size of the input is 0(n -\- k). We omit the description of this 
linear-time implementation in this extended abstract. 
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4 Complexity Results 

In this section, we examine the complexity of Edge, Vertex and Unrestricted 
Vertex Multicut. First we prove that Edge and Vertex Multicut are NP-hard in 
bounded-degree trees, while Unrestricted Vertex Multicut is NP-hard in series- 
parallel graphs of bounded degree. Second, we show that the Weighted Edge 
Multicut is Max SNP-hard in binary tree. Finally we prove that Edge, Vertex and 
Unrestricted Vertex Multicut are Max SNP-hard in walls (defined in Section 4) . 

Theorem 7. Edge Multicut in binary trees is NP-hard. 

Proof. The reduction is from 3-SAT, a well-known NP-complete problem [8]. 

Consider an instance ^ of 3-SAT, that is, a set of m clauses C\,C 2 t ■ ■ , Cm 
on n variables X\,X 2 , ■ ■ ■ , Xn, each clause with exactly three literals. 

Let us construct an instance of Edge Multicut: a binary tree T and a set of 
pairs of distinct vertices of T. The tree T is built as follows. For each variable 
Xi, there is a gadget as depicted in Figure 2 (a). The gadget consists of a binary 
tree with three vertices: the root and two leaves, one labeled Xi and the other 
labeled Xi. For each clause Cj, there is a gadget as depicted in Figure 2 (b). The 
gadget consists of a binary tree with five vertices: the root, one internal vertex 
and three leaves, each one labeled by one of the literals in Cj. 




Fig. 2. (a) The gadget for variable Xi. (b) The gadget for clause Cj = 
{xi,X2, 3:3}. (c) Tree T built for the instance <? = (a;i V X2 V 3:3) A (xi V 0:2 V 3:3), 
that is. Cl = {xi,X 2 tXz\ and C 2 = {xi,X 2 ,X 3 }. 



The tree T is built from these n -|- to gadgets by arbitrarily connecting them 
using new vertices to get a binary tree. See Figure 2 (c) for an example. 

Next, we give the set of pairs of vertices of T in our instance. For each variable 
Xi, there is a pair with the vertices labeled Xi and Xi in its gadget. For each clause 
Cj, there are two pairs: one formed by the two leaves that are siblings and the 
other formed by the last leaf and the internal vertex. Finally, each vertex labeled 
Xi in the gadget for a clause is paired to the vertex labeled Xi in the gadget for 
the variable Xi, where Xi S {xi,Xi}. This ends the construction of the instance 
for Edge Multicut. Note that all this can be done in polynomial time in the size 
of <P. 
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The next lemma completes the proof of Theorem 7. ■ 

Lemma 8. <P is satisfiable if an only if there is a solution for T of size exactly 
n + 2m. Moreover, we can construct in polynomial time such a solution for T 
from a truth assignment for and vice versa. 

Proof. Assume ^ is satisfiable. Let us present a solution S for T of size exactly 
n + 2m. The edge set S consists of two types of edges: 

1. For each variable Xi, S contains the edge in the gadget for Xi incident to the 
leaf labeled Xi if Xi = TRUE or to the leaf labeled Xi if Xi =FALSE. 

2. For each clause Cj, S contains two distinct edges in the gadget for Cj. These 
edges are such that (1) they disconnect the two pairs in the gadget, and (2) 
the only leaf that is still connected to the root of the gadget is a leaf with 
a label Xi e Cj such that Xi = TRUE. (The four possible choices for the two 
edges are shown in Figure 3.) 




r r 




Fig. 3. Possible choices of two edges, the dashed edges, in the gadget for a clause 
that leave exactly one leaf (the marked leaf v) connected to the root r. 



Clearly such set S has exactly n + 2m edges and can be constructed in 
polynomial time from <?. Let us prove that S is in fact a solution for T. It is 
easy to see that S disconnected the pairs for the variables, and the pairs for the 
clauses. The remaining pairs consist of two vertices labeled by a literal Xi, one in 
the variable gadget for Xi and the other in a clause gadget. If Xi = TRUE, then the 
edge in the variable gadget incident to the vertex labeled Xi is in S, guaranteeing 
that the pair is disconnected. If Xt = FA USE, then the vertex labeled Xi in the 
clause gadget is disconnected from the root of this gadget, and therefore, from 
the gadget for Xi. Thus S' is a solution for T, and it has exactly n + 2m edges. 

Let us prove the inverse implication. Assume there is a solution S for T 
with exactly n + 2m edges: one per variable and two per clause (one for each of 
the “disjoint” pairs). More specifically, S has exactly one edge in each variable 
gadget, and exactly two edges in each clause gadget in one of the configurations 
of Figure 3. Set Xi =TRUE if the edge of S in the gadget for Xi is incident to 
the vertex labeled xp, set Xi =FALSE otherwise. Clearly, we can determine this 
truth assignment in polynomial time. 
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For each clause Cj, there is exactly one leaf v in the gadget for Cj that is 
connected to the root r of the gadget. Let Xi G {xi,Xi} be the label for this leaf. 
There is a pair formed by this leaf v and the leaf in the gadget for Xi whose label 
is Xi- In S, there must be an edge e in the path between these two leaves. Since 
leaf V is connected to the root r of the gadget for Cj and all edges in S are either 
in a variable gadget or in a clause gadget, this edge e has to be in the variable 
gadget. This means e is the edge incident to the leaf labeled xi in the gadget 
for Xi- Hence Xi =TRUE, and the clause is satisfied. Since this holds for all the 
clauses, the given assignment makes TRUE, implying that <P is satisfiable. ■ 

Theorem 9. Vertex Multicut in trees with maximum degree at most four is 
NP-hard. 

We omit the proof. The construction is similar to the one used in Theorem 7. 

Theorem 10. Unrestricted Vertex Multicut in series-parallel graphs with max- 
imum degree at most three is NP-hard. 

We omit the proof. The construction is similar to the one used in Theorem 7. 

Theorem 11. Weighted Edge Multicut is Max SNP-hard in binary trees. 

Proof sketch. Let us reduce Edge Multicut in stars to Weighted Edge Multicut 
in binary trees. From an instance of the Unweighted Edge Multicut restricted 
to stars, we construct an instance of the Weighted Edge Multicut restricted to 
binary trees in the following way: for each leaf of the star S, there is a corre- 
sponding leaf in the binary tree T . The pairs are the same (we may assume there 
is no pair involving the root of the star) . We connect the leaves of T arbitrarily 
into a binary tree. The edges in T incident to the leaves get weight one and all 
other edges of T get weight 2n-\- 1, where n is the number of leaves in the star 
S (which is the same as the number of leaves in the tree T we construct) . Any 
solution within twice the optimum for the Weighted Edge Multicut instance we 
constructed will contain only edges of T incident to the leaves, since any other 
edge is too heavy (removing all edges incident to the leaves, we get a solution of 
weight n). Then it is easy to see that any optimal solution for the Weighted Edge 
Multicut instance we constructed corresponds to an optimal solution for the orig- 
inal Unweighted Multicut star instance, and vice versa. Also approximability is 
preserved by this reduction. ■ 

A wall of height h consists of ft- -I- 1 vertex disjoint paths i?o, . . . , Rh^ which 
we call rows, and ft -|- 1 vertex disjoint paths Lq, . . . , Lh, which we call columns. 
A wall of height six is depicted in Figure 4 (a). The reader should be able to 
complete the definition by considering Figure 4 (a). The formal definition is as 
follows. Each row is a path of 2ft -|- 2 vertices. Each column, a path with 2ft -|- 2 
vertices. Column r contains the (2r -|- 1)®* and the (2r -I- 2)”®* vertices of all rows, 
as well as the edge between them. For i < h and even, each Lr contains an edge 
between the (2r -|- 2)”'^ vertex of Ri and the (2r -|- 2)"'^ vertex of Ri+i. For i < h 
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(b) 



Xj X2 Xj Xj X2 Xj 



Fig. 4. (a) A wall of height six. The dark edges indicate row Ri and column L^- 
(b) The three last rows of the wall built from ^ = (xi V 1 e2 V X 3 ,){x\ V X 2 V a; 3 ). 



and odd, each contains an edge between the (2r + 1)^* vertex of Ri and the 
(2r + 1)^* vertex of Ri+\. These are all the edges of the wall. 

We prove that Edge, Vertex and Unrestricted Vertex Multicut are Max SNP- 
hard in walls. This means, by Arora et al. [1], that there is a constant e > 0 
such that the existence of a polynomial-time approximation algorithm for any 
of the three versions of Multicut with performance ratio at most 1 -I- e implies 
that P=NP. 

As in [18], we use the concept of L-reduction, which is a special kind of 
reduction that preserves approximability. 

Let A and B be two optimization problems. We say A L-reduces to B if there 
are two polynomial-time algorithms / and g, and positive constants a and /?, 
such that for each instance / of A, 

1. Algorithm / produces an instance /' = /(/) of B, such that the optima 
of / and of costs denoted OptA{I) and OptB(I') respectively, satisfy 
Optsil') < a ■ OptA{I), and 

2. Given any feasible solution of /' with cost c', algorithm g produces a solution 
of / with cost c such that |c — OptA{I)\ < /? • |c' — OptB(I')\- 



Theorem 12. Edge, Vertex and Unrestricted Vertex Multicut are Max SNP- 
hard in walls. 

Proof sketch. The reduction is from the well-known Max SNP-hard problem 
MAX 3-SAT [18]. We show the reduction for Unrestricted Vertex Multicut. The 
other two reductions are similar. 

The first part of the L-reduction is the polynomial-time algorithm / and 
the constant a. Given any instance ‘P of MAX 3-SAT, / produces an instance 
W,C of Unrestricted Vertex Multicut such that lU is a wall. Also, the cost of 
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the optimum of W,C in Unrestricted Vertex Multicut, denoted Optj^(y{W,C), 
is at most a times the cost of the optimum of in MAX 3-SAT, denoted by 
OptsATi^)i i-6-> Opt]^fQ(W,C) < a ■ Optgj^j,(^). 

Consider an instance of MAX 3-SAT, that is, a collection of m clauses on n 
variables xi, . . . , a;„, each consisting of exactly three literals. Let us describe the 
corresponding instance for Unrestricted Vertex Multicut. The wall lU is a wall 
of height 6m. To describe the collection C of pairs of vertices of W, consider the 
last row of W partitioned into m same length paths, each one associated to one 
of the clauses of <P. Each path has length 12. Label the 2"'^, 6*^ and 10*^ vertices 
in the path each with one of the literals in the clause. See Figure 4 (b) 
for an example. For each pair of vertices u, v in W, u labeled Xi and v labeled 
Xi, include into C the pair u,v. For each clause, include three pairs. The three 
pairs formed by each two of the vertices labeled by its three literals. This ends 
the description of the instance of Unrestricted Vertex Multicut. 

First note that W and C can be obtained in polynomial time in the size of 
<P. 

Lemma 13. Optj^Q{W,C) < 6 • Opt g j^j,{<P) . 

Proof sketch. W,C clearly has a solution of size 3m. Also Optgj^rp{(P) > 
m/2. ■ 

Lemma 14. From a solution to of size s, 0 < s < m, we can obtain a solution 
to W, C of size 3m — s and vice versa. 

Proof sketch. Given an assignment that satisfies s clauses of F, let S be the 
set of all labeled vertices of W except one labeled vertex per satisfied clause. 
Choose to not include in S' a vertex labeled by a literal that is assigned TRUE. 
One can verify that this set S is a solution for W, C of size 3m — s. 

Now, consider a solution S for W, C of size 3m — s. Since W has height 6m, 
there is a row Ri of W which has no vertex of S. Set to TRUE any literal which 
appears as a label of a vertex of W that is connected to Ri after the removal of 
S. If some variable was not assigned a value by this rule, assign it an arbitrary 
value. Note that, since vertices labeled Xi are not connected to vertices labeled Xi 
after the removal of S', the assignment is well-defined. Consider the six columns 
of the wall corresponding to the clause of F. S should contain at least two 
vertices in these columns, otherwise there would be a path connecting at least 
two of the labeled vertices in these columns. This means that at least s clauses 
have only two vertices removed from their columns of W . Thus one of the labeled 
vertices is connected to row Ri, meaning that this clause is satisfied. ■ 

The previous two lemmas can be used in an obvious way to show the reduction 
we presented is an L-reduction. ■ 
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Abstract. The edge and vertex-disjoint path problems together with 
their unsplittable flow generalization are NP-hard problems with a multi- 
tude of applications in areas such as routing, scheduling and bin packing. 
Given the hardness of the problems, we study polynomial-time approxi- 
mation algorithms with bounded performance guarantees. We introduce 
techniques which yield new algorithms for a wide range of disjoint-path 
problems. We use two basic techniques. First, we propose simple greedy 
algorithms for edge- and vertex-disjoint paths and second, we propose the 
use of a framework based on paeking integer programs for more general 
problems such as unsplittable flow. As part of our tools we develop im- 
proved approximation algorithms for a class of packing integer programs, 
a result that we believe is of independent interest. 



1 Introduction 

This paper examines approximation algorithms for disjoint-path problems and 
their generalizations. In the edge( vertex) -disjoint path problem, we are given a 
graph G = (V, E) and a set T of connection requests, also called commodities. 
Every connection request in T is a vertex pair (si, tf), 1 < i < K. The objective 
is to connect a maximum number of the pairs via edge( vertexj-disjoint paths. 
For the vertex-disjoint paths problem, the connection requests are assumed to be 
disjoint. We call the set of connected pairs realizable. A generalization of the edge- 
disjoint paths problem is multiple- source unsplittable flow. In this problem every 
commodity k in the set T has an associated demand pk, and every edge e has a 
capacity Ue. The demand pk must be routed on a single path from Sk to tk. The 
objective is to maximize the sum of the demands that can be fully routed while 
respecting the capacity constraints. Wlog, we assume that max^ pk = 1, and 
following the standard definition of the problem in the literature, Ug > 1, Ve G 
E. When all demands and capacities are 1 in the multiple-source unsplittable 
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flow problem we obtain the edge-disjoint path problem. (See [10,14] for further 
applications and motivation for unsplittable flow.) In all the above problems 
one can assign a weight Wi < 1 to each connection request and seek to And a 
realizable set of maximum total weight. In this paper we will state explicitly 
when we deal with the weighted version of a problem. 

Both the edge- and vertex-disjoint path problems are fundamental, exten- 
sively studied (see e.g. [26,6,27,21,10,13,3]), NP-hard problems [9], with a mul- 
titude of applications in areas such as telecommunications, VLSI and schedul- 
ing. Despite the attention they have received, disjoint-path problems on general 
graphs remain notoriously hard in terms of approximation; even for edge-disjoint 
paths, no algorithm is known which can And even an uj{\/ ^J\E\) fraction of the 
realizable paths. 

In approximating these problems, we use the traditional notion of a p- approxi- 
mation algorithm, p > 1, which is one that outputs, in polynomial time, a real- 
izable set of size at least 1/p times the optimum. We will also give and refer to 
algorithms which output a realizable set whose size is a non-linear function of 
the optimum OPT, such as OPT"^ /\E\. 

Overview of Previous Work. Two main approaches have been followed for ap- 
proximation. 

(i) The first approach, which we call the rounding approach, consists of solving a 
fractional relaxation and then use rounding techniques to obtain an integral solu- 
tion. The fractional relaxation is typically multicommodity flow and the rounding 
techniques used to date involved sophisticated and non-standard use of random- 
ized rounding [31]. The objective value of the resulting solution is compared to 
the fractional optimum y* , which is an upper bound on the integral optimum, 
OPT. This approach has been the more successful one and recently yielded the 
first approximation algorithm for uniform unsplittable flow [31] which is the spe- 
cial case of unsplittable flow where all the capacities have the same value. Let d 
denote the dilation of the fractional solution, i.e. the maximum length of a flow 
path in the fractional relaxation. Bounds that rely on the dilation are particu- 
larly appealing for expander graphs where it is known that d = 0{polylog{n)) 
[16,12]. The rounding approach yields, for unweighted uniform unsplittable flow 
(and thus for unweighted edge-disjoint paths as well) a realizable set of size 
0{m.asi{{y*fl /\E\,y* / ^J\E\,y* /d}) and an n{m.asi{{y*fl /\E\,y* / d}) bound for 
the weighted version [31] . This approach is known to have limitations, e.g. 
it is known that a gap of Q{^J\V\) exists between the fractional and integral 
optima for both the edge- and vertex-disjoint path problems on a graph with 
\E\ = e{\V\) [7]. 

(ii) Under the second approach, which we call the routing approach, a commodity 
is never split, i.e. routed fractionally along more than one path during the course 
of the algorithm. In the analysis, the objective value of the solution is compared 
to an estimated upper bound on the OPT. This approach has found very limited 
applicability so far, one reason being the perceived hardness of deriving upper 
bounds on OPT without resorting to a fractional relaxation. The only example 
of this method we are aware of is the on-line Bounded Greedy Algorithm in 
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[10] whose approximation guarantee depends also on the diameter of the graph. 
The algorithm can be easily modified into an off-line procedure that outputs 
realizable sets of size fI{OPT / ^\E\) {fI{OPT / ^\V\)) for edge( vertex)-disjoint 
paths. The fi{OPT / ^J\V\) bound is the best known bound to date for vertex- 
disjoint paths. 



Table 1. Known approximation bounds for edge-disjoint paths (EDP), uniform 
capacity unsplittable flow (UCUFP), and general unsplittable flow (UFP), 17- 
notation omitted. Eg denotes the set of edges used by some path in an integral 
optimal solution and do the average length of the paths in the same solution. 
Results with no citation come from the present paper. Our y* / ^J\E\ bound for 
the weighted EDP problem holds under the assumption that the number of 
connection requests K = 0{\E\). 
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Our Contribution. In this paper we provide techniques for approximating disjoint- 
path problems that bear on both of the above approaches. Tables 1 and 2 sum- 
marize previous and new bounds for edge-, vertex-disjoint path and unsplittable 
flow problems. 

Under the routing approach (approach (ii)) we give a simple deterministic 
greedy algorithm Greedy _Path for edge-disjoint paths that has performance 
guarantees comparable to those obtained by the multicommodity flow based 
algorithms. Greedy algorithms have been extensively studied in combinatorial 
optimization due to their elegance and simplicity. Our work provides another 
example of the usefulness of the greedy method. The underlying idea is that if 
one keeps routing commodities along sufficiently short paths the final number of 
commodities routed is lowerbounded with respect to the optimum. 

Greedy _Path outputs a realizable set of size f7(max{OPT^/|ifo|, 

OPT / ^J\EoW) for the edge-disjoint path problem. Here Eg Q E is the set of 
edges used by the paths in an optimal solution. Note that OPT"^ /\Eo\ always 
dominates OPT / ^J\Eo\ in the unweighted case that we consider; we give both 
bounds to facilitate comparison with existing work and to conform to the tradi- 
tional notion of a /3-approximation algorithm. Our approximation existentially 
improves upon the multicommodity-flow based results when \Eo\ = o(|if|), i.e. 
when the optimal solution uses a small portion of the edges of the graph. An- 
other bound can be obtained by noticing that OPT'^ /\Eo\ = OPT /do, where do 
denotes the average length of the paths in an optimal solution. 
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Essentially the same algorithm, Greedy _VPath, obtains for the vertex- 
disjoint path problem a realizable set of size Q{th&-k{OPT'^ /\V o\,OPT / ^J\VoW), 
where Vo Q V \s the set of vertices used by the paths in an optimal solution. 
Recall that the best known bound to date is t = f2{OPT/ The realizable 

set output by our algorithm has size and potentially better than this 

when \Vo\ = o(|y|). This is a significant improvement when OPT = UJ{^J\V\). 
For example, when OPT = f2(|E|), we obtain a constant-factor approximation. 
Again an Q{OPT / do) guarantee follows immediately. 



Table 2. Known approximation bounds for vertex-disjoint paths, 17-notation 
omitted. Vo denotes the set of vertices used by some path in an integral optimal 
solution and do the average length of the paths in the same solution. Results 
with no citation come from the present paper. 
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We turn to the rounding approach (approach (i)) to handle the weighted dis- 
joint path and unsplittable flow problems. We propose the use of packing integer 
programs as a unifying framework that abstracts away the need for customized 
and complex randomized rounding schemes. A packing integer program is of the 
form maximize • x, subject to Ax < b, A,b,c> 0. We first develop, as part 
of our tools, an improved approximation algorithm for a class of packing integer 
programs, called column restricted, that are relevant to unsplittable flow prob- 
lems. Armed with both this new algorithm and existing algorithms for general 
packing integer programs, we show how packing formulations both provide a 
unified and simplified derivation of many results from [31] and lead to new ones. 
In particular, we obtain the first approximation algorithm for weighted multiple- 
source unsplittable flow on networks with arbitrary demands and capacities and 
the first approximation algorithm for weighted vertex-disjoint paths. Further, we 
believe that our new algorithm for column-restricted packing integer programs 
is of independent interest. We now elaborate on our results under the rounding 
approach, providing further background as necessary. 



1.1 Packing Integer Programs 

Packing integer programs are a well-studied class of integer programs that can 
model several NP-complete problems, including independent set, hypergraph 
fc-matching [19,1], job-shop scheduling [23,28,33,20] and many flow and path 
related problems. Many of these problems seem to be difficult to approximate, 
and not much is known about their worst-case approximation ratios. Following 
[30] a packing integer program (PIP) is defined as follows. 
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Definition 1. Given A e [0, [> g [l,oo)’” and c G [0, 1]” with max^ cj = 
1, a PIP V = (A, 5, c) seeks to maximize (F ■ x subject to x G Z” and Ax < b. 
Constraints of the form 0 < Xj < dj are also allowed. If A G {0, each 
entry of b is assumed integral. Let B = and a be the maximum number 

of non-zero entries in any column of A. 

The parameters B and a in the definition above appear in the approximation 
bounds. For convenience we call bt the capacity of row i. The restrictions on the 
values of the entries of A, b, c are wlog; the values in an arbitrary packing program 
can be scaled to satisfy the above requirements [29] . We will state explicitly when 
some packing program in this paper deviates from these requirements. When 
A G {0, we say that we have a (0, 1)-PIP. 

Previous Work on Packing Programs. The basic techniques for approximat- 
ing packing integer programs have been the randomized rounding technique of 
Raghavan and Thompson [24,25] and the work of Plotkin, Shmoys and Tar- 
dos [23]. Let y* denote the optimum value of the linear relaxation. Standard 
randomized rounding yields integral solutions of value 12{y* for general 

pip’s and n{y* for (0, l)-PIP’s [25] (see also [29].) Srinivasan [29,30] 
improved on the standard randomized rounding bounds and obtained bounds of 
fl{y*{y* and Q{y* / for general PIP’s and f2{y*{y* 
and f2{y* for (0, l)-PIP’s. 

New Results for Column- Restricted PIP’s. The above results show that for vari- 
ous combinations of values for y*, to and B, the bounds obtained for a (0, 1)-PIP 
are significantly better than those for general PIP’s. In fact they are always better 
when y* < to. As another example, the approximation ratio obtained 

for a (0, 1)-PIP is polynomially better than the approximation ratio of a PIP with 
the same parameters. Thus it is natural to ask whether we can bridge this gap. 
We make progress in this direction by defining a column-restrictedPIP Vr as one 
where all non-zero entries of the j-th column of A have the same value pj < 1. 
Column-restricted PIP’s arise in applications such as unsplittable flow problems 
(see next section) . We show how to obtain approximation guarantees for column- 
restricted PIP’s that are similar to the ones obtained for (0, l)-PIP’s. Let y* de- 
note the optimum of the linear relaxation of Vr. We obtain an integral solution 
of value C(y*/TO^/('®+^)) and fl{y* j cA ! ^~) . Letting cr(y*) = f2{yAy* we 

also obtain a bound that is at least as good as cr(y*) for y* < to log n and in any 
case it is never worse by more than a 0(log^^^ n) factor. Finally we show how to 
improve upon the stated approximations when maxj pj is bounded away from 1 . 
We develop the latter two results in a more complete version of this paper [15]. 

We now give an overview of our technique. First we find an optimum solution 
X* to the linear relaxation of the column-restricted PIP Vr . We partition the pj ’s 
into a fixed number of intervals according to their values and generate a packing 
subproblem for each range. In a packing subproblem V^ corresponding to range 
L, we only include the columns of A with pj G L and to each component of 
the 6-^-vector we allocate only a fraction of the original bi value, a fraction 
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that is determined based on information from x*. Next we find approximate 
solutions to each subproblem and combine them to obtain a solution to the 
original problem. Perhaps the key idea is in using the solution x* to define the 
capacity allocation to the 6 ^-vector for subproblem This generalizes previous 
work of the authors [14] on single-source unsplittable flow. The other key idea 
is that each subproblem can be approximated almost as well as a (0, 1)-PIP. 

1.2 Applications of Packing to Approximation 

We introduce a new framework for applying packing techniques to disjoint-path 
problems. First, we formulate an integer program (which is not necessarily a 
PIP) and solve a linear relaxation of this integer program to obtain a solution 
X. Typically this is a multicommodity flow problem. We then explicitly use the 
solution X to guide the formation of a column-restricted or (0, 1) PIP. A related 
usage of a solution to the linear relaxation of integer programs in a different con- 
text can be found in [8,32]. An integral approximate solution to the created PIP 
will be an approximate solution to the original disjoint path problem (with pos- 
sibly some small degradation in the approximation factor) . This integral solution 
can be found using existing algorithms for approximating PIP’s as a black box. 
Our algorithms apply to the case when there are weights on the commodities, 
and thus generalize those of Srinivasan for edge-disjoint paths. This approach 
yields four applications which we explain below. 

Application 1: Weighted Unsplittable Flow. Let Fi,F 2 ,Fs denote (y*)'^ /\E\,y* /d 
and y*/ a/|^ respectively. We obtain a realizable set of weight l 7 (max{F 3 / log \E\, 
Fi/log^ \E\, F 2 }) for unsplittable flow with arbitrary demands and capacities. In 
fact we can give a better Ff-type bound for small enough y*, whose analytical 
form is complicated. See [15] for further details. In the case where the number of 
commodities K = 0{\E\) we show how to obtain also an I2(max{Fi/ log |F|, F 3 }) 
bound. Notice that for the edge-disjoint path problem this is a natural assump- 
tion since at most |F| connection requests can be feasibly routed. We also note 
that a /9-approximation for y* entails an 0(plog |F|) approximation for the prob- 
lem of routing in rounds [2,10]. We do not pursue any further the latter problem 
in this extended abstract. 

Application 2: Weighted Vertex- Disjoint Paths. We give an algorithm that out- 
puts a solution of value fi{Ta&yi{{y*Y /\V\,y* / ^J\V\,y* / d}). The algorithm re- 
lies on the observation that, after solving a fractional relaxation, the problem 
of rounding is essentially an instance of hypergraph matching; thus it can be 
formulated as a packing program with \ V\ constraints. The algorithm is surpris- 
ingly simple but the performance guarantee matches the integrality gap known 
for the problem [7]. 

Application 3: Routing with Low Congestion. A problem that has received a 
lot of attention in the literature on routing problems (e.g. [25,17,23,22,10,14]) is 
that of minimizing congestion, i.e. the factor by which one is allowed to scale 
up capacities in order to achieve an optimal (or near-optimal) realizable set. In 
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our usage of packing in the rounding algorithms we have assumed that the pa- 
rameter B of the packing program is equal to 1 . Allowing B > 1 is equivalent to 
allowing congestion B in the corresponding disjoint-path problem. Thus another 
advantage of the packing approach is that tradeoffs with the allowed congestion 
B can be obtained immediately by plugging in B in the packing algorithms that 
we use as a black box. For example the approximation for edge-disjoint paths 
becomes f2{ma,x{y*{y* /\E\ log \E\)^/^ ,y* when the num- 
ber of connection requests is 0(|if |). Our congestion tradeoffs generalize previous 
work by Srinivasan [31] who showed the fl{y* /d}/^) tradeoff for uniform capac- 
ity unsplittable flow. We do not state the tradeoffs explicitly for the various 
problems since they can be obtained easily by simple modifications to the given 
algorithms. 

Application 4- Independent Set in the Square of a Graph. Given a graph G = 
(V,E) the k-th power G^ = {V, E^) of G is a, graph where two vertices are adja- 
cent if and only if they are at distance at most k in G. We further demonstrate 
the power of packing formulations by providing an 0{^J\V\) approximation algo- 
rithm for finding a maximum independent set in the square of a graph. We also 
give results that depend on the maximum vertex degree A in G. Our approxi- 
mation ratio cannot be polynomially improved in the sense that no 
approximation, for any fixed e > 0, can be obtained in polynomial time unless 
NP = ZPP. Studying NP-hard problems in powers of graphs is a topic that has 
received some attention in the literature [5,34,18,4]. 

Independently of our work, Baveja and Srinivasan (personal communication) 
have obtained results similar to ours for approximating vertex-disjoint paths 
under the rounding approach, unsplittable flow and column-restricted packing 
integer programs. Their work builds on the methods in [31]. 

2 Approximating a Column-Restricted PIP 

In this section we present the approximation algorithm for column-restricted 
pip’s. Let V = (A,b,c) be a column-restricted PIP. We call pj < 1, the value 
of the non-zero entries of the j-th column, 1 < j < n, the value of column j. 
Throughout this section we assume that there is a polynomial-time algorithm 
that given a (0, 1)-PIP with fractional optimum y* outputs an integral solution 
of value at least a{m, B, a, y*) where m, B, a are the parameters of the packing 
program. For example a known cr is 12{y* We start by providing a sub- 
routine for solving a column-restricted PIP when the column values are close in 
range. 

Theorem 1. Let V = (A, 6, c) he a column-restricted PIP where all column 
values pj are equal to p and where each hi = kiP p, P > 1, h positive integer, 
1 < i < m. Here min^ 6^ is not necessarily greater than 1. Then we can find in 
polynomial time a solution of value at least a{m,kP,a,y*), where y* denotes the 
optimum of the linear relaxation ofV and k = miniki. 
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Proof. Transform the given system "P to a (0, 1)-PIP V' = {A\ b', c) where 6' = 
kiP, and AG = Aij/p. Every feasible solution (either fractional or integral) x to 
V' is a feasible solution to V and vice versa. Therefore the fractional optimum y* 
is the same for both programs. Also the maximum number of non-zero entries on 
any column is the same for A and A! . Thus we can unambiguously use a for both. 
We have assumed that there is an approximation algorithm for V' returning a 
solution with objective value cr(m, /cT, a,y*). Invoking this algorithm completes 
the proof. □ 

The proof of the following lemma generalizes that of Lemma 4.1 in [11]. 

Lemma 1. Let V = (A, 6, c), be a column-restricted PIP with column values in 
the interval ( 01 , 02 ], and bi > T 02 , Vz and some number T > 1. Here miiiibi 
is not necessarily greater than 1. There is an algorithm a_PACKiNG that finds a 
solution g to V of value at least a{m,P,a, ■^y*), where y* is the optimum of 
the fractional relaxation ofV. The algorithm runs in polynomial time. 

Proof sketch. We sketch the algorithm a_PACKiNG. Obtain a PIP V' = (A', 6', c) 
from V as follows. Round down bi to the nearest multiple of P 02 and then 
multiply it with ai/ 02 . Set 6' equal to the resulting value. Every 6' is now 
between oi/2o2 and 01/02 times the corresponding bi. Set AG to oi if Ay 0 
and to 0 otherwise. V' has thus a fractional solution of value at least (oi/2o2)y* 
that can be obtained by scaling down the optimal fractional solution of V . Note 
that every is a multiple of Pai. Thus we can invoke Theorem 1 and find 
a solution g' to V' of value at least a(jn, P,a, {ai/2a2)y*). Scaling up every 
component of g' by a factor of at most 02/01 yields a vector g that is feasible 
for V and has value at least cr(m, T, a, ^y*). □ 

Lemma 2. Let V = (A, 6, c), be a column-restricted PIP with column values 
in the interval ( 01 , 02 ], and bi a multiple of Pa 2 , Vz and some number P > 1. 
Here mini bi is not necessarily greater than 1 . There is an algorithm Inter- 
val_Packing that finds a solution g to V of value at least a{m, P,a, ^y*), 
where y* is the optimum of the fractional relaxation ofV. The algorithm runs 
in polynomial time. 

Proof sketch. Similar to that of Lemma 1. Since all bfs are already multiples of 
T 02 , we don’t pay the 1/2 factor for the initial rounding. □ 

We now give the idea behind the full algorithm. The technique generalizes 
earlier work of the authors on single-source unsplittable flow [14]. Let x * denote 
the optimum solution to the linear relaxation of P. We are going to create packing 
subproblems P^ = (A^,b^,c^) where A^ contains only the columns of A with 
values in some fixed range (oa-IjCTa]- We will obtain our integral solution to 
V by combining approximate solutions to the subproblems. The crucial step is 
capacity allocation to subproblems. Consider a candidate for the 6'^-vector that 
we call the A-th fractional capacity vector. In the fractional capacity vector the 
z-th entry is equal to X)j|p 3 G(a>_i ax] other words we allocate capacity 

to the A-th subproblem by “pulling” out of b the amount of capacity used up 
in the solution x * by the columns with values in (oa-i, caa]- The benefit of such 
a scheme would be that by using the relevant entries of x * we could obtain a 
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feasible solution to the linear relaxation of P^. However, to be able to benefit 
from Lemma 1 to find an approximate integral solution to each P^, we need all 
entries of to be larger than Ba\. This increases the required capacity for each 
subproblem potentially above the fractional capacity. Thus we resort to more 
sophisticated capacity allocation to ensure that (i) there is enough total capacity 
in the 6-vector of P to share among the subproblems (ii) the subproblems can 
benefit from the subroutines in Lemmata 1, 2. In particular, we initially assign 
to each subproblem A only 1/2 of the fractional capacity vector; this has the 
asymptotically negligible effect of scaling down the fractional optimum for each 
subproblem by at most 2. We exploit the unused (1/2)6 vector of capacity to 
add an extra Ba\ units to the entries of 6^, VA. 

Given Oj, aj, let be the set of column indices k for which ai < pk < aj. 

We then define, to be the m x submatrix of A consisting of the 

columns in and for any vector x, to be the | [-entry subvector 

X consisting of the entries whose indices are in We will also need to 

combine back together the various subvectors, and define U • • • 

to be the n-entry vector in which the various entries in the various subvectors 
are mapped back into their original positions. Any positions not in any of the 
corresponding index sets are set to 0. 



ALGORITHM COLUMN_PARTITIOn(P) 

step 1. Find the n- vector a;* that yields the optimal solution to the linear relaxation 
of P. 

Step 2a. Define a partition of the (0, 1] interval into ^ = 0(log n) consecutive subinter- 



vals (0, n 



.( 4 - 



A — A+1 



(4 ,4 ], (4 ,1] where k is a, constant larger 



than 1. For A = 1 ... ^ — 1 form subproblem = (A^ , b^,c^). and are the 



restrictions defined by A^ = 

= (A^,6^,c^) such that = A°’’ 






and c“ = c’ 

lA 



Define similarly 



Step 2b. Let di be the ith entry of (A^ • a;/). We define = b when A is 1 or G and 
otherwise 6/ = 13(1/4)^“^ -I- (l/2)d/. 

Step 3. Form solution x by setting Xj to 1 if pj € (0,n“^], and 0 otherwise. 

Step 4 . On each P^, 2 < A < ^ — 1, invoke O-Packing to obtain a solution vector 
x^. Combine the solutions to subproblems 2 through ^ — 1 to form n- vector x = 
UaxA^e-i*"^. 

Step 5. Invoke Interval.Packing on P^ to obtain a solution vector x^. Let x = Ua;^. 
Step 6. Of the three vectors x, x and x, output the one, call it x, that maximizes 



Two tasks remain. First, we must show that the vector x output by the 
algorithm is a feasible solution to the original packing problem P . Second, we 
must lower bound ■ x in terms of the optimum ■ a;* of the fractional 

relaxation of P. Let , and if ri = (1/4)'^ and r 2 = (1/4)'^“^ 

then we abbreviate as y^. We examine first the vector x. 

Lemma 3. Algorithm Column_Partition runs in polynomial time and the n- 
vector X it outputs is a feasible solution to P of value at least X)a =2 ^ B,a, 
{l/16)yf). 

Proof sketch. Let y^ be the optimal solution to the linear relaxation of P^. By 
Lemma 1 the value of the solution x^, 2 < A < ■J”!) found at Step 4 is at 
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least cr(m, a, (l/2)(l/4)y'^). By the definition of in P\, (l/2)a;^ ,i-e- the 
restriction of x* scaled by 1/2, is a feasible fractional solution for P\. Thus 

> (l/2)y*- Hence the value of is at least a{m, B,a,{l/16)y^). The claim 
on the value follows. 

For the feasibility, we note that the aggregate capacity used by x on row i of 
A is the sum of the capacities used by 2 < A < ^ — 1, on each subproblem. 
This sum is by Step 2b at most (1/2) ^ 

H ^^” 2 ( 1 / 4 )'^”^ < (1/2)5 < (l/2)6i and X)a= 2 ~^ ^ Thus the aggregate 

capacity used by x is at most (l/2)&i + (1/2)6^ = bi. □ 

It remains to account for x and x. The following theorem is the main result 
of this section. 

Theorem 2. Let V = {A, b, c) be a column-restricted PIP and y* he the op- 
timum of the linear relaxation of V. Algorithm Column_Partition finds in 
polynomial time a solution g to V of value f^{inax{y* , 
y*{y*/mlogny/^}). 

Proof. Each of the three vectors x, x, x solves a packing problem with column 
values lying in (0,1/n^], (l/n^,l/4] and (1/4,1] respectively. Let 'P,V,V be 
the three induced packing problems. The optimal solution to the linear relax- 
ation of at least one of them will have value at least 1/3 of the optimal so- 
lution to the linear relaxation oi V. It remains to lower bound the approxi- 
mation achieved by each of the three vectors on its corresponding domain of 
column values. Since m, B, and a are fixed for all subproblems, note that cr 
is a function of one variable, y*. Vector x solves P optimally. The solution is 
feasible since all column values are less than 1/n^ and thus the value of the 
left-hand side of any packing constraint cannot exceed 1. By Lemma 1, vector 
X outputs a solution to V of value at least a{m,B,a, {1 / A)yV . For V, the 
value of the solution output is given by the sum A = X)a =2 B, a, (l/16)y/‘) 
in Lemma 3. We distinguish two cases. If ct is a function linear in y* then 
A > cr(m, 5, a, (l/16)j/» ’ ). If ct is a function convex in y* the sum A is 

minimized when all the terms yf are equal to 6>(y” ’ /logn). Instantiating 

CT(y») with the function I7(max{y*/m^/^'®'''^^, y*/a^/^}) in the linear case and 
with 12(y*(y*/m)^/^) in the convex case completes the proof. □ 

3 Applications of PIP’s to Approximation 

3.1 Weighted Multiple- Source Unsplittable Flow 

Our approach for weighted unsplittable flow consists of finding first the optimum 
of the fractional relaxation, i.e. weighted multicommodity flow, which can be 
solved in polynomial time via linear programming. The relaxation consists of 
allowing commodity k to be shipped along more than one path. Call these paths 
the fractional paths. We round in two stages. In the first stage we select at most 
one of the fractional paths for each commodity, at the expense of congestion. 
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i.e. some capacities may be violated. In addition, some commodities may not 
be routed at all. In the second stage, among the commodities routed during the 
first stage, we select those that will ultimately be routed while respecting the 
capacity constraints. It is in this last stage that a column-restricted PIP is used. 

We introduce some terminology before giving the algorithm. A routing is 
a set of Ski~tki paths Pk^, used to route pki amount of flow from Ski to tk^ 
for each (ski,tki) G I Q T. Given a routing g, the flow ge through edge e is 
equal to ^p.^g Pi- A routing g for which ge < Ue for every edge e is an 
unsplittable flow. A fractional routing is one where for commodity k (i) the flow 
is split along potentially many paths (ii) demand fk < pk is routed. A fractional 
routing corresponds thus to standard multicommodity flow. A fractional single- 
path routing is one where the flow for a commodity is shipped on one path if at 
all, but only a fraction fk < Pk of the demand is shipped for commodity k. The 
value of a routing g is the weighted sum of the demands routed in g. 

ALGORITHM MAX_R0UTING(G = (V,E,u),T) 

Step 1. Find an optimum fractional routing / by invoking a weighted multicommodity 
flow algorithm. Denote by (ff) the value of /. 

Step 2. Scale up all capacities by a factor of 0(log|£l|) to obtain network G' with 
capacity function u' . Invoke Raghavan’s algorithm [24] on G' to round / to a 
routing g' . 

Step 3. Scale down the flow on every path of g' by a factor of at most 0(log jSj) to 
obtain a fractional single-path routing g that is feasible for G. 

Step 4- Construct a column-restricted PIP V = {A, b, c) as follows. Let k\,k 2 , . . . ,k\ 
be the set of commodities shipped in g, \ < K. A has A columns, one for each 
commodity in g, and \E\ rows, one for each edge of G. Aij = pk^ if the path Pk^ 
in g for commodity kj goes through edge d and 0 otherwise. The cost vector c 
has entry Cj set to Wk^Pk^ for each commodity kj shipped in g. Finally, bi = u^i, 
l<i< \E\. 

Step 5. Invoke algorithm Column.Partition to find an integral solution g to V. 
Construct an unsplittable flow g" by routing commodity kj on path Pk^ if and 
only if gj = 1. 



Theorem 3. Given a weighted multiple- source unsplittable flow problem (G = 
(V,E),T), algorithm Max_Routing finds in polynomial time an unsplittable 
flow of value at least f7(max{a-^(T)/(log {T)fl / {\E\ log^ |^'|)})> 

where (T) is the value of an optimum fractional routing f. 

Proof sketch. First note that since routing g' is feasible for G', after scaling down 
the flow at Step 3, routing g is feasible for G. The rounded solution to V main- 
tains feasibility. For the value, we can easily extend the analysis of Raghavan’s 
algorithm to show that even with weights, it routes in G' a constant fraction 
of a^(T) [24]. Set Zk^ to be equal to the amount of flow that is routed in g for 
commodity kj divided by pk^ . The A- vector z is a feasible fractional solution to 
V of value l7(a^(T)/ log |if|). The theorem follows by using Theorem 2 to lower 
bound the value of the integral solution g to V. □ 

We now give a bound that depends on the dilation d of the fractional routing. 
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Theorem 4. Given a weighted multiple- source unsplittable flow problem {G = 
(y,i?),T), there is a polynomial-time algorithm that finds an unsplittable flow 
of value at least Q{a^ {T)/d), where is the value and d the dilation of an 

optimum fractional routing f. 

The construction in the proof of Theorem 4 can also be used to give an 
f2(max{a-^(r)/(A/|^), log |£1|)}), bound in the case where the 

number of commodities |T| = 0{\E\). We omit the details in this version. 



3.2 Weighted Vertex-Disjoint Paths 

A relaxation of the problem is integral multicommodity flow where every com- 
modity has an associated demand of 1. We can express vertex-disjoint paths 
as an integer program X that has the same constraints as multicommodity flow 
together with additional “bandwidth” constraints on the vertices such that the 
total flow through a vertex is at most 1. Let CV be the linear relaxation of 
X. The optimal solution / to CP consists of a set of fractional flow paths. Our 
algorithm relies on the observation that / gives rise (and at the same time is a 
fractional solution) to a PIP. The particular PIP models a I-matching problem 
on a hypergraph iJ with vertex set V and a hyperedge (subset of V) for every 
path in the fractional solution. In other words, the paths in / may be viewed 
as sets of vertices without any regard for the flow through the edges of G. We 
proceed to give the full algorithm. 

ALGORITHM PATH_PACKING(G = (V,E),T) 

Step 1. Formulate the linear relaxation CP and find an optimal solution / to it. Using 
flow decomposition express / as a set of paths Pi, P 2 , . . ■ , P\ each connecting a pair 
of terminals and carrying Zi < 1, 1 < i < X, units of flow. 

Step 2. Construct a (0, 1)-PIP V = {A,b,c) as follows. A is a \V\ x A matrix; Aij is 
1 if path Pj includes vertex i and 0 otherwise. 6 is a vector of ones. Cj is equal to 
Wk such that path Pj connects terminals Sk and tk- 
Step 3. Find an integral solution g to V and output the corresponding set of paths 

P{g)- 



Theorem 5. Given a weighted vertex- disjoint paths problem (G = (V,E),T), 
algorithm Path_Packing finds in polynomial time a solution of value 
n{max{y*/i/\V\,{y*fl/\V\,y*/d}), where d is the dilation and y* is the value 
of an optimum solution to the linear relaxation CP. 

Proof sketch. We show first that P{g) is a feasible solution to the problem. 
Clearly the constraints of the packing program V, constructed at Step 2 ensure 
that the paths in P{g) are vertex disjoint. This excludes the possibility that 
more than one (sfe,tfe)-path is present in P{g) for some k. The optimal value of 
the linear relaxation of V is at least y* since setting xj equal to zj, 1 < j < A, 
yields a feasible fractional solution to V. By applying either standard randomized 
rounding [25] or Srinivasan’s algorithms [29,30] at Step 3, we obtain the claimed 
bounds on the objective value ■ g. □ 
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3.3 An Application to Independent Set 

In this section we show how a packing formulation leads to an 0{^J\V\) approx- 
imation for the following problem: find a maximum weight independent set in 
the square of the graph G = {V,E). 

Theorem 6. Given a graph G = (V, E) and c G [0, a weight vector on the 
vertices, there exists a polynomial-time algorithm that outputs an independent set 
in the square = (V, E"^) of G of weight I7(max{?/*/ a/|F|, /\V\,y* / A}). 

Here y* denotes the optimum of a fractional relaxation and A is the maximum 
vertex degree in G. 

A hardness of approximation result for the problem of finding a maximum 
independent set in the k-t\i power of a graph follows. 

Theorem 7. For the problem of finding a maximum independent set in the k-th 
power G^ = (V, E’^) of a graph G = {V, E) for any fixed integer k > 0, there is no 
p- approximation with p = for any fixed e > 0, unless NP = ZPP. 

4 Greedy Algorithms for Disjoint Paths 

In this section we turn to the routing approach for the unweighted edge- and 
vertex-disjoint path problems. 

ALGORITHM GREEDY_PATH(G, T) 

Step 1. Set A to 0. 

Step 2. Let be the commodity in T such that the shortest path P* in G from 

s* to t* has minimum length. If no such path exists halt and output A. 

Step 3. Add P* to A and remove the edges of P* from G. Remove {s*,t*) from T. 
Goto Step 2. 

We begin by lowerbounding the approximation ratio of the algorithm. 
Theorem 8. Algorithm Greedy _Path runs in polynomial time and outputs 
a solution to an edge-disjoint paths problem (G = (V,E),T) of size at least 
^/{\/\^\ + 1) times the optimum, where Eo Q E is the set of edges used by the 
paths in an optimal solution. 

We now give improved bounds on the size of the output realizable set. 

Theorem 9. Algorithm Greedy _Path outputs a solution to an edge-disjoint 
path problem (G = (V,E),T) of size G{OPT^ /\Eo\), where Eg Q E is the set of 
edges used by the paths in an optimal solution. 

Proof. Let t be the total number of iterations of Greedy _Path and Ai be the set 
A at the end of the z-th iteration. Let O be an optimal set of paths. We say that a 
path Px hits a path Py, if P^ and Py share an edge. We define the set OqA as the 
paths in O that correspond to commodities not routed in A. Let Pi be the path 
added to A at the z-th iteration of the algorithm. If Pi hits ki paths in OoAi that 
are not hit by a path in Ai-i, then Pi must have length at least ki. In turn each 
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of the paths hit has length at least ki otherwise it would have been selected by 
the algorithm instead of Pi. Furthermore all paths in O are edge-disjoint with 
total number of edges \Eo\. Therefore — \^o\- Applying the Cauchy- 

Schwarz inequality on the left-hand side we obtain that (X)i=i /t < \Eo\. But 
X)i=i ki = \0 © At I since upon termination of the algorithm all paths in O eAt 
must have been hit by some path in At - We obtain < \Eo\. Wlog we can 

assume that \ At \ = o{\0\), since otherwise Greedy_Path obtains a constant- 
factor approximation. It follows that t = f}{\0\^)/\Eo\) = Q{OPT'^ /\Eo\). □ 

Corollary 1. Algorithm Greedy_Path outputs a solution to an edge-disjoint 
path problem (G = (V,E),T) of size Q{OPT / do), where do is the average length 
of the paths in an optimal solution. 

Algorithm Greedy _Path gives a solution to vertex-disjoint paths with the fol- 
lowing modification at Step 3: remove the vertices of P* from G. Call the resulting 
algorithm Greedy _VPath. The analogues of the results above can be seen to 
hold for Greedy _VPath as well. 

Theorem 10. Greedy _VPath outputs a solution to a vertex- disjoint 

path problem (G = {V,E),T) of size G{m.a,x{OPT/\Vo\,OPT‘^ /\Vo\,OPT/do}), 
where Vo ffV is the set of vertices used by the paths in an optimal solution and 
do is the average length of the paths in an optimal solution. 
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Abstract. The mixed postman problem, a generalization of the Chinese 
postman problem, is that of finding a shortest tour that traverses each 
edge of a given mixed graph (a graph containing both undirected and 
directed edges) at least once. The problem is solvable in polynomial time 
either if the graph is undirected or if the graph is directed, but NP-hard 
in mixed graphs. An approximation algorithm with a performance ratio 
of 3/2 for the postman problem on mixed graphs is presented. 



1 Introduction 

Problems of finding paths and tours on graphs are of fundamental importance 
and find many practical applications. The Traveling salesman problem (TSP) 
is a well-known and widely studied problem. The objective is to find a shortest 
tour that visits all vertices of a given graph exactly once. The problem is known 
to be NP-hard. Postman problems are similar to TSP at first glance, but are 
quite different in terms of the complexity of the problems. Given a graph G, 
the Chinese postman problem (CPP) is to find a minimum cost tour covering 
all edges of G at least once [9]. It is the optimization version of the Euler tour 
problem, which asks if there is a tour that traverses every edges of a graph 
exactly once. Edmonds and Johnson [3] showed that the problem is solvable in 
polynomial time. They also showed that the problem is solvable in polynomial 
time if G is a directed graph. 

The Mixed Postman Problem (MPP) is a generalization of the Chinese 
Postman Problem and is listed as Problem ND25 by Garey and Johnson [8]. In 
the mixed postman problem, the input graph may contain both undirected edges 
and arcs (directed edges). The objective is to find a tour that traverses every 
edge at least once, and which traverses directed edges only in the direction of the 
arc. Even though both undirected and directed versions of the Chinese postman 
problem are polynomially solvable, Papadimitriou [13] showed that MPP is NP- 
hard. There are other related problems, such as the Rural postman problem 
and the Windy postman problem, which are also NP-hard [4,5]. Many practical 
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applications like mail delivery, snow removal, and trash pick-up can be modeled 
as instances of MPP, and hence it is important to design good approximation 
algorithms for this problem. 

Previous Work: Numerous articles have appeared in the literature over 
the past three decades about the mixed postman problem. Edmonds and John- 
son [3], and Christofides [2] presented the first approximation algorithms. Fred- 
erickson [7] showed that the algorithm of [3] finds a tour whose length is at 
most 2 times the length of an optimal tour (i.e., approximation ratio of 2). 
He also presented a mixed strategy algorithm, which used the solutions output 
by two different heuristics, and then selected the shorter of the two tours. He 
proved that the approximation ratio of the mixed strategy algorithm is |. Com- 
prehensive surveys are available on postman problems [1,4]. Integer and linear 
programming formulations of postman problems have generated a lot of interest 
in recent years [10,12,16]. Ralphs [16] showed that a linear relaxation of MPP 
has optimal solutions that are half-integral. One could use this to derive a 2- 
approximation algorithm for the problem. Improvements in implementation are 
discussed in [12,15]. It is interesting to note that Nobert and Picard [12] state 
that their implementation has been used for scheduling snow removal in Mon- 
treal. Several other articles have appeared on generalized postman problems, 
such as /c-CPP [14] and the Windy Postman problem [4]. 

Our Results: Even though numerous articles have appeared in the litera- 
ture on MPP after Frederickson’s paper in 1979, his result has been the best 
approximation algorithm for MPP in terms of proven worst-case ratio until now. 
In this paper, we present an improved approximation algorithm for MPP with an 
approximation ratio of | . We study the properties of feasible solutions to MPP, 
and derive a new lower bound on the cost of an optimal solution. Our algorithm 
uses a subtle modification of Frederickson’s algorithm, and the improved perfor- 
mance ratio is derived from the new lower bound. We present examples showing 
that our analysis is tight. 



2 Preliminaries 

Problem Statement: The input graph G = (V, E, A) consists of a set of 
vertices V, a multi-set of edges E, and a multi-set of arcs (directed edges) A. 
A non-negative cost function C is defined on edges and arcs. We extend the 
definition of C to graphs (multisets of edges and arcs), by taking the sum of the 
costs of its edges and arcs. We assume that the graph is strongly connected, i.e., 
there exists a path from any vertex u to any other vertex v, since the problem 
is clearly infeasible for graphs that are not strongly connected. The output is 
a tour which may travel each edge or arc several times. Therefore, except for 
traversing each edge/arc once, the traversal could always use a shortest path 
between any two nodes. Hence we assume that the weights of the edges satisfy 
the triangle inequality. 

Definitions: A cut is a partition of the vertex set into S and P — S'. It is 
called nontrivial if neither side is empty. An edge crosses the cut if it connects 
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a vertex in S' to a vertex in — S. Let outdegree{v) be the number of outgoing 
arcs from v. Similarly, indegree{v) is the number of incoming arcs into v. Let 
degreeiv) be the total number of edges and arcs incident to v. We say v has even 
degree if degree{v) is even. A graph has even degree if all its vertices have even 
degree. Let surplus{v) = outdegree{v) — indegree{v) . If surplus{v) is negative, 
we may call it as a deficit. The definition of surplus can be extended to sets of 
vertices S, by finding the difference between outgoing and incoming edges that 
cross the cut {S,V — S). 

Properties of Eulerian Graphs: A graph is called Eulerian if there is a 
tour that traverses each edge of the graph exactly once. It is known that an 
undirected, connected graph is Eulerian if and only if the degree of each vertex 
is even. For a directed graph to be Eulerian, the underlying graph must be 
connected and for each vertex v, outdegree{v) = indegree{v). In other words, 
for each vertex v, surplus{v) = 0. A mixed graph G = (V,E,A) is Eulerian if 
the graph is strongly connected and satisfies the following properties [6,11]: 

— Even Degree Condition: Every vertex v is incident to an even number 
of edges and arcs, i.e., degree{v) is even. This condition implies that the 
number of edges and arcs crossing any cut (S', V — S) is even. 

— Balanced Set Condition: For every nontrivial cut S C V, the absolute 
surplus of S must be less than or equal to the number of undirected edges 
crossing the cut (S, E — S). 

The above conditions can be checked in polynomial time using algorithms for 
the maximum flow problem [6] . In other words, we can decide in polynomial time 
whether a given mixed graph is Eulerian. The problem we are interested in is to 
find set of additional edges and arcs of total minimum cost that can be added 
to G to make it Eulerian, and this problem is NP-hard. In the process of solving 
mixed postman problem, arcs and edges may be duplicated. For convenience, we 
may also orient some undirected edges by giving them a direction. The output 
of our algorithm is an Eulerian graph El that contains the input graph G as a 
subgraph. So each edge of El can be classified either as an original edge or as 
a duplicated edge. Also, each arc of H is either an original arc, or a duplicated 
arc, or an oriented edge, or a duplicated & oriented edge. 

3 Prederickson’s Mixed Algorithm 

Frederickson defined the following algorithms as part of his solution to MPP: 

— Evendegree: augment a mixed graph G by duplicating edges and arcs 
such that the resulting graph has even degree. A minimum-cost solution 
is obtained by disregarding the directions of the arcs (i.e., by taking the 
underlying undirected graph) and solving CPP by adding a minimum-weight 
matching of odd-degree nodes. 

— Inoutdegree: augment a mixed graph G by duplicating edges and arcs, 
and orienting edges such that in the resulting graph, for each vertex v, 
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surplus{v) = 0. We will refer to this as the Inout problem. A minimum-cost 
solution Gio is obtained by formulating a flow problem and solving it op- 
timally. The augmentation cost, Cio{G) is defined as the cost of additional 
arcs and edges that were added to G to get G/o by Inoutdegree. 

— Evenparity: applied to the output of Inoutdegree on an even-degree 
graph; restores even degree to all nodes without increasing the cost, while 
retaining the property that indegree = outdegree at all nodes. Edmonds and 
Johnson [3] indicated that Inoutdegree can be applied to an even-degree 
graph in such a way that the resulting graph has even degree and hence 
Eulerian. Frederickson [7] showed a simple linear-time algorithm to perform 
the task. The basis of this algorithm is that a suitably defined subgraph of 
undirected edges and duplicated edges/arcs forms a collection of Eulerian 
graphs. 

— Largecycles: similar to Evendegree except that only edges are allowed 
to be duplicated, and arcs are not considered. 




Fig. 1. Mixed algorithm 



Figure 1 shows an approximation algorithm for MPP presented by Frederick- 
son [7], called the Mixed algorithm. It comprises of two heuristics called MixedI 
and Mixed2. Heuristic MixedI first runs Evendegree to make the degree of 
all nodes even. Then it runs Inoutdegree to make indegree = outdegree for 
all vertices. Finally, Evenparity restores even degree to all the nodes without 
increasing the cost, and the graph becomes Eulerian. Heuristic Mixed2 first 
calls Inoutdegree, and then makes the graph Eulerian by calling Largecy- 
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CLES. Since Largecycles disregards the arcs of the graph, no further steps are 
needed. The Mixed algorithm outputs the best solution of the two heuristics, 
and Frederickson showed its performance ratio is at most 

4 Structure of Inout Problem 

In this section, we will identify a few critical properties of the Inout problem, 
which we use improve the Mixed algorithm. Let G be a mixed graph, and let 
Gb and Gr (blue graph and red graph respectively) be two augmentations of G 
for the Inout problem, i.e., Gb and Gr are two Inout solutions. 

Let Gbr = Gb (B Gr denote the symmetric difference of Gb and Gr, the 
multigraph containing edges and arcs in one graph, but not the other. Since we 
are dealing with multigraphs, an edge/arc (u,v), occurs in Gbr as many times 
as the difference between the number of copies of (u,v) in Gb and Gr- A set of 
arcs G in Gbr is an alternating cycle if the underlying edges of G form a simple 
cycle, and in any walk around the cycle, the blue arcs of G are directed one way, 
and the red arcs of G are directed in the opposite direction (see Figure 2). 



Blue arcs 

^ Red arcs 



Fig. 2. An alternating cycle 




The following lemmas show how alternating cycles of Gbr can be used to 
switch between different Inout solutions. 

Lemma 1. The arcs of Gbr can be decomposed into alternating cycles. 

Proof. This proof is based on the properties of Eulerian directed graphs. Since 
they are Inout solutions, each vertex of Gb and Gr satisfies the condition, 
injdegree = outjdegree. However, this rule does not apply to Gbr, since all com- 
mon arcs and edges have been removed. Since only common arcs are removed, it 
can be verified that for each vertex in Gbr, surplusb = surplusr- In other words, 
at each vertex of Gbr, the net surplus or deficit created by blue arcs is equal to 
the net surplus or deficit created by red arcs. So at each vertex, an incoming 
blue arc can be associated with either an outgoing blue arc, or an incoming red 
arc. Similar statement is valid for red arcs also. 

Consider a directed walk starting at an arbitrary vertex in Gbr, in which 
blue edges are traversed in the forward direction and red edges are traversed 
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in the reverse direction. Whenever any vertex is revisited during the walk, an 
alternating cycle is removed and output. When the start vertex is re-visited 
and there are no more arcs incident to it, restart the walk from another vertex. 
Continue the process until all arcs of Gbr have been decomposed into alternating 
cycles. An alternative proof can be given by showing that reversing all red arcs 
makes Gbr satisfy indegree = outdegree at each node and hence the arcs can be 
decomposed into cycles. 

Each alternating cycle AC in Gbr is composed of one or more blue paths and 
red paths. In each cycle, all blue paths go in one direction and all red paths go 
in the opposite direction. Let us call blue arcs of AC as B and red arcs as R. 

Define the operation of removing a set of arcs B from an Inout solution 
Gio, denoted by Gjo — B, as follows. If an arc of B is an additional arc or edge 
added during the augmentation, then it can be removed. On the other hand, if 
the arc was an undirected edge in G that was oriented during the augmentation, 
then we remove the orientation but leave the edge in the graph. The cost of such 
an operation is the total cost of the arcs/edges that are removed. Similarly, we 
can define the addition operation G/o + B, where a set of arcs B are added to 

Gio- 

Lemma 2. Consider two Inout solutions Gb and Gr, and their symmetric dif- 
ference Gbr- Let AC he an alternating cycle in Gbr, with blue component B and 
red component R. Then, Gb — B R and Gr — R-\- B are also Inout solutions. 

Proof. Clearly, adding or deleting edges of AC does not affect the deficit/surplus 
of the nodes that are not in AC. For all nodes in AC, the net deficit/surplus 
created by blue paths of AC is same as the net deficit/surplus created by red 
paths of AC. So, the net deficit/surplus created by removal of B from Gb is 
compensated by the addition of R. Therefore, Gb-B-\-R is an Inout solution, as 
each node of Gb-B-\-Rhas net deficit/surplus of zero. By symmetry, Gr-R-\-B 
is also an Inout solution. 



Lemma 3. Let d~{R) be the cost of removing R from Gr and let d'^{R) be the 
cost of adding R to Gb. Similarly, let d~{B) he the cost of removing B from 
Gb and let d'^{B) he the cost of adding B to Gr. Then d~^{R) < d~{R) and 
d+{B) < d~{B). 

Proof. We prove that the cost of adding R to Gb does not exceed the cost of 
removing R from Gr. Arcs of R in Gr can be either additional arc, oriented 
edge, or additional oriented edge. Additional arcs and additional oriented edges 
contribute at most the same cost to d~^ (R) . If an oriented edge is in R, then either 
it is not currently oriented in Gb or it is oriented in the opposite direction in Gb. 
To add this oriented edge to Gb, we can either orient the undirected edge, or 
remove its opposite orientation; either way there is no additional cost. Therefore 
R can be added to Gb without incurring additional cost. Hence, d+(i?) < d~{R). 
By symmetry, d~^{B) < d~{B). 
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5 Improved Lower Bound for Mixed Postman Problem 

Consider a mixed graph G = {V,E,A), with edges E and arcs A. Let C* be 
the weight of an optimal postman tour of G. Suppose, given G as input, In- 
OUTDEGREE Outputs G/o = (P) U, M). U C E are the edges of G that were not 
oriented by Inoutdegree. M A A are arcs that satisfy indegree = outdegree 
at each vertex. Let Cm and Cu be the total weight of M and U respectively. 
Note that Cm + Cu = Cio{G) + C{G). M may contain several disjoint directed 
components. Suppose we shrink each directed component of G/o into a single 
vertex and form an undirected graph UG. If we run Evendegree on [7G, it 
adds a minimum-weight matching (i.e., a subset of U) to make the degree of 
each vertex in [7G even. Let the weight of this matching be Cx- Frederickson [7] 
used a lower bound of Cm + Cu on C* . We show the following improved lower 
bound on C* , which allows us to improve the approximation ratio. 

Lemma 4. Cm + Cu + Cx < C* . 

In order to prove the above lemma, we first show that when we find an 
optimal Inout solution, if some undirected edges are not oriented (edges in the 
set U) by Inoutdegree, then adding additional copies of edges in U (when one 
runs Evendegree in UG) to G does not decrease G/o(G), the augmentation 
cost of Inoutdegree. 

Lemma 5. Let G = (E, E, A) be a mixed graph. Let Gio = {V, U, M) he an opti- 
mal Inout solution computed by Inoutdegree algorithm. Let U = {ui, ...,Uk} 
be the undirected edges of G that were not oriented by Inoutdegree, even 
though the algorithm could orient these edges without incurring additional cost. 
Adding additional copies of edges in U to G does not decrease the augmentation 
cost of an optimal Inout solution. 

Proof. We give a proof by contradiction. Let H be the graph G to which Ui copies 
of Mi {i = 1 , ... , k ) are added such that the augmentation cost of Inoutdegree 
is less for H than G, i.e., Cio{H) < G/o(G). In addition, let H he a, minimal 
supergraph of G with this property, and if we add any fewer copies of the edges 
of U, then Cio{H) = G/o(G). Clearly, some edges of U must be oriented in 
Hio, otherwise an Inout solution of H would also be an Inout solution of G. 
Assigning G/ = G/o and Gr = Hio, we will prove that d~{R) < d~{B) for 
every cycle AC that uses at least one edge of U. Recall that, d~{R) is the cost of 
removing R from the red Inout solution Hjo, and d~{B) is the cost of removing 
B from the blue Inout solution G/o- 

Consider any alternating cycle AC that contains an edge of U. Let B and R 
be its blue and red arcs respectively. By Lemma 3, d~^{B) < d~{B). If d~{R) > 
d~{B) then, 

C{Hio -R + B)= C{Hio) - d~{R) + d+{B) < C{Hio). 

Therefore, Hjo — R-\- B is an optimal Inout solution of H, and it uses fewer 
copies of edges of U than H . This contradicts the minimality assumption of 
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H. Therefore d~{R) < d~{B) for all cycles AC that involve at least one edge 
from U. Choose any such cycle AC. We already know that d~^{R) < d~{R) and 
d~{R) < d~{B). Combining these, we get, 

C{Gio -B + R)= C{Cio) - d~{B) + d+{R) < C(G/o)- 

Note that G/o — B + R uses at most one copy of each edge from U and its cost 
is less than C{Gio). In other words, G/o — i? + i? is an Inout solution of G, 
and C{Gio — B + R) < G(G/o)- This contradicts the assumption that G/o is 
an optimal Inout solution of G. Therefore, adding additional copies of edges 
from U does not decrease the augmentation cost of an Inout solution. 



Proof of Lemma 4- Let G* be an optimal solution, whose cost is G*. Consider 
the nodes of UG, the nodes corresponding to the directed components of M. G* 
needs to have additional matching edges (subset of U) between these components 
to satisfy the even-degree condition for each component. This matching costs at 
least Cx- By Lemma 5, we know that additional edges of U do not decrease the 
cost of optimal Inout solution. Therefore the augmentation cost of the Inout 
problem is still G/o(G). Hence the total cost of G* is at least Cx + Cio{G) + 
C(G). Substituting Cm + Cu for G/o(G) -k G(G), we get, Cm + Cu + Cx <C*. 

6 Modified Mixed Algorithm 

Figure 3 describes the modified Mixed algorithm. First, we run algorithm In- 
OUTDEGREE On input graph G and obtain G/o = {V,U,M). Before running 
Evendegree of Mixed 1 algorithm, reset the weights of all arcs and edges used 
by M to 0, forcing Evendegree to duplicate edges/arcs of M whenever possi- 
ble, as opposed to duplicating edges of U. Use the actual weights for the rest of 
the Mixed 1 algorithm. There are no changes made in Mixed2 algorithm. 
Remark: The cost of arcs and edges of M need not be set to zero. In practice, 
the cost of each edge in U could be scaled up by a big constant. This ensures that 
the total cost of edges from U in the the minimum-cost matching is minimized. 

6.1 Analysis of Modified Mixed Algorithm 

Lemma 6. Let Cm be the cost of arcs in M and let C* he the cost of an optimal 
postman tour ofG. The cost of the tour generated by Modified MixedI algorithm 
is at most C* + Cm- 

Proof. Consider the components induced by the arcs of M. In original graph 
G, the arcs of M correspond to arcs and oriented edges, possibly duplicated. 
Since the original edges and arcs of G corresponding to M are made to have 
zero-cost when Evendegree is run, any component that has an even number of 
odd-degree vertices can be matched using the arcs of M at zero cost. Therefore, 
the algorithm adds a minimum-cost matching of cost Cx- Let H be the graph 
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Fig. 3. Modified Mixed algorithm 



at this stage. Note that Evendegree duplicates each arc of M at most once to 
form H. We follow Frederickson’s analysis [7] for the rest of the proof: Let Mi 
be the multiset of arcs such that there are two arcs in Mi for each arc in M . 
Clearly, M and Mi both satisfy Inout property. Hence, the union of U, X, and 
Ml forms a Inout solution containing H, whose cost is Cu + Cx + 2 * Cm- Since 
Inoutdegree is an optimal algorithm for Inout problem, it is guaranteed to 
find an Inout solution of cost at most Cu + Cx + 2 * Cm- This is at most 
C* + Cm by Lemma 4. Finally, Evenparity does not change the cost of the 
solution. 



Lemma 7 (Frederickson [7]). Algorithm Mixed2 finds a tour whose cost is 
at most 2C* — Cm- 



Theorem 1. Algorithm Modified Mixed produces a tour whose cost is at most 

I 

Proof. By Lemma 6, Modified Mixed 1 outputs a solution whose cost is at most 
Cm + C*, which is at most | C*, if Cm < C* j 2. On the other hand, if Cm > 
C*/2, then by Lemma 7, Mixed2 outputs a solution whose cost is at most 
2C* — Cm, which is at most | C* . 
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7 Conclusion 

We have presented an algorithm and analysis to achieve approximation ratio of 
3/2 for the mixed postman problem. Improvement in the performance ratio is 
achieved by proving an improved lower bound on the cost of an optimal postman 
tour. The performance ratio is tight as shown by Frederickson’s examples. 
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Abstract. We consider the uncapacitated facility location problem. In 
this problem, there is a set of locations at which facilities can be built; 
a fixed cost fi is incurred if a facility is opened at location i. Further- 
more, there is a set of demand locations to be serviced by the opened 
facilities; if the demand location j is assigned to a facility at location i, 
then there is an associated service cost of aj. The objective is to de- 
termine which facilities to open and an assignment of demand points to 
the opened facilities, so as to minimize the total cost. We assume that 
the service costs dj are symmetric and satisfy the triangle inequality. 
For this problem we obtain a (1 -|- 2/e)-approximation algorithm, where 
1 -b 2/e « 1.736, which is a significant improvement on the previously 
known approximation guarantees. 

The algorithm works by rounding an optimal fractional solution to a 
linear programming relaxation. Our techniques use properties of opti- 
mal solutions to the linear program, randomized rounding, as well as a 
generalization of the decomposition techniques of Shmoys, Tardos, and 
Aardal. 



1 Introduction 

The study of the location of facilities to serve clients at minimum cost has been 
one of the most studied themes in the field of Operations Research (see, e.g., the 
textbook edited by Mirchandani and Francis [9]). In this paper, we focus on one 
of its simplest variants, the uncapacitated facility location problem, also known 
as the simple plant location problem, which has been extensively treated in the 
literature (see the chapter by Cornuejols, Nemhauser, and Wolsey in [9]). This 
problem can be described as follows. There is a set of potential facility locations 
J-; building a facility at location i G !F has an associated nonnegative fixed cost 
fi, and any open facility can provide an unlimited amount of certain commodity. 
There also is a set of clients or demand points T> that require service; client j G T> 
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by ONR grant N00014-96-1-00500. 
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has a positive demand of commodity dj that must be shipped from one of the 
open facilities. If a facility at location i G !F is used to satisfy the demand of client 
j G T>, the service or transportation cost incurred is proportional to the distance 
from i to j, Cij . The goal is to determine a subset of the set of potential facility 
locations at which to open facilities and an assignment of clients to these facilities 
so as to minimize the overall total cost, that is, the fixed costs of opening the 
facilities plus the total service cost. We will only consider the metric variant of the 
problem in which the distance function c is nonnegative, symmetric and satisfies 
the triangle inequality. Throughout the paper, a p- approximation algorithm is 
a polynomial-time algorithm that is guaranteed to deliver a feasible solution of 
objective function value within a factor of p of optimum. The main result of 
this paper is a (1 -I- 2/e)-approximation algorithm for the metric uncapacitated 
facility location problem, where 1 -I- 2/e « 1.736. 

Notice that our result is based on worst case analysis, that is, our solu- 
tions will be within a factor of (1 -I- 2/e) of optimum for any instance of the 
problem. Such a strong assurance can have salient practical implications: these 
algorithms often outperform algorithms whose design was not grounded by the 
mathematical understanding required for proving performance guarantees. We 
have corroborated this assertion for our algorithm through a few computational 
experiments, which will be reported in a follow-up paper. 

In contrast to the uncapacitated facility location problem, Cornuejols, Fisher, 
and Nemhauser [3] studied the problem in which the objective is to maximize 
the difference between assignment and facility costs. They showed that with this 
objective, the problem can be thought of as a bank account location problem. 
Notice that even though these two problems are equivalent from the point of 
view of optimization, they are not from the point of view of approximation. 
Interestingly, Cornuejols, Fisher, and Nemhauser showed that for the maximiza- 
tion problem, a greedy procedure that iteratively tries to open the facility that 
most improves the objective function yields a solution of value within a constant 
factor of optimum. 

The metric uncapacitated facility location problem is known to be NP-hard 
(see [4]). Very recently, Guha and Khuller [7] have shown that it is Max SNP- 
hard. In fact, they have also shown that the existence of a p-approximation 
algorithm for p < 1.463 implies that NP C TIME(n‘^^^°®^°®"^) (see also Feige 

[ 5 ]). 

We briefly review previous work on approximation algorithms for the metric 
uncapacitated facility location problem. The first constant factor approximation 
algorithm was given by Shmoys, Tardos, and Aardal [11], who presented a 3.16- 
approximation algorithm, based on rounding an optimal solution of a classical 
linear programming relaxation for the problem. This bound was subsequently 
improved by Guha and Khuller [7], who provided a 2.408-approximation algo- 
rithm. Guha and Khuller’s algorithm requires a stronger linear programming 
relaxation. They add to the relaxation of [11] a facility budget constraint that 
bounds the total fractional facility cost by the optimal facility cost. After run- 
ning the algorithms of [11], they use a greedy procedure (as in [3]) to improve the 




182 



Fabian A. Chudak 



quality of the solution: iteratively, open one facility at a time if it improves the 
cost of the solution. Since they can only guess the optimal facility cost to within 
a factor of (1 + e), they are in fact solving a weakly polynomial number of linear 
programs. In contrast, the 1.736-approximation algorithm presented in this pa- 
per requires the solution of just one linear program, providing as a by-product 
further evidence of the strength of this linear programming relaxation. 

Without loss of generality we shall assume that the sets T and T> are disjoint; 
let Af = IF U I?, n = |A/”|. Even though all our results hold for the case of 
arbitrary demands, for sake of simplicity of the exposition, we will assume that 
each demand dj is 1 {j e V); thus, the cost of assigning a client j to an open 
facility at location i is Cy. The distance between any two points k,£ G J\f is Ckt- 
We assume that the nxn distance matrix {cm) is nonnegative, symmetric {cm = 
Cik, for all k,i G Af) and satisfies the triangle inequality, that is, Cy < Cik + Ckj, 
for all i,j, k G Af. The simplest linear programming relaxation (due to Balinski, 
1965 [2]), which we will refer to as P, is as follows: 

(P) subject to = 1 foi' 6^ch j G T> (1) 

Xij < Hi for each i G T, j GV (2) 

Xij > 0 for each i G T, j G V . (3) 

Any 0-1 feasible solution corresponds to a feasible solution to the uncapacitated 
facility location problem: yi = 1 indicates that a facility at location i G iF is 
open, while Xij = 1 means that client j G T> is serviced by the facility built 
at location i G T. Inequalities (1) state that each demand point j G V must 
be assigned to some facility, while inequalities (2) say that clients can only be 
assigned to open facilities. Thus the linear program P is indeed a relaxation of the 
problem. Given a feasible fractional solution {x,y), we will say that fijji 

and respectively, its fractional facility and service cost. 

Given a feasible solution to the linear programming relaxation P, the algo- 
rithm of Shmoys, Tardos, and Aardal first partitions the demand points into 
clusters and then for each cluster opens exactly one facility, which services all 
the points in it. In their analysis, they show that the resulting solution has the 
property that the total facility cost is within a constant factor of the fractional 
facility cost and the total service cost is within a constant factor of the fractional 
service cost. The main drawback of this approach is that most of the time the 
solution is unbalanced, in the sense that the first constant is approximately three 
times smaller than the second. 

One of the simplest ways to round an optimal solution {x*,y*) to the lin- 
ear program P is to use the randomized rounding technique of Raghavan and 
Thompson [10] as proposed by Sviridenko [12] for the special case in which all 
the distances are 1 or 2. Essentially, the 1.2785-approximation algorithm of [12] 
opens a facility at location i G T with probability y*-, and then assigns each 
demand point to its nearest facility. This matches the lower bound of Guha and 
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Khuller for this special case. Independently of our work, Ageev and Sviridenko 
[1] have recently shown that the randomized rounding analysis for the maximum 
satisfiability problem of Goemans and Williamson [6] can be adapted to obtain 
improved bounds for the maximization version of the problem. 

The following simple ideas enable us to develop a rounding procedure for the 
linear programming relaxation P with an improved performance guarantee. We 
explicitly exploit optimality conditions of the linear program, and in particular, 
we use properties of the optimal dual solution and complementary slackness. A 
key element to our improvement is the use of randomized rounding in conjunction 
with the approach of Shmoys, Tardos, and Aardal. To understand our approach, 
suppose that for each location z G iF, independently, we open a facility at z with 
probability y*. The difficulty arises when attempting to estimate the expected 
service cost: the distance from a given demand point to the closest open facility 
might be too large. However, we could always use the routing of the algorithm 
of Shmoys, Tardos, and Aardal if we knew that each cluster has a facility open. 
In essence, rather than opening each facility independently with probability y*, 
we instead open one facility in each cluster with probability y*. The algorithm 
is not much more complicated, but the most refined analysis of it is not quite so 
simple. Our algorithms are randomized, and can be easily derandomized using 
the method of conditional expectations. The main result of this paper is the 
following. 

Theorem 1. There is a polynomial-time algorithm that rounds an optimal so- 
lution to the linear programming relaxation P to a feasible integer solution whose 
value is within (1 + 2/e) « 1.736 of the optimal value of the linear programming 
relaxation P. 

Since the optimal LP value is a lower bound on the integer optimal value, the 
theorem yields a 1.736-approximation algorithm, whose running time is domi- 
nated by the time required to solve the linear programming relaxation P. As a 
consequence of the theorem, we obtain the following corollary on the quality of 
the value of the linear programming relaxation. 

Corollary 1. The optimal value of the linear programming relaxation P is within 
a factor of 1.736 of the optimal cost. 

This improves on the previously best known factor of 3.16 presented in [11]. 

2 A Simple 4-Approximation Algorithm 

In this section we present a new simple 4-approximation algorithm. Even though 
the guarantees we will prove in the next section are substantially better, we will 
use most of the ideas presented here. Next we define the neighborhood of a 
demand point k € T>. 

Definition 1. If{x,y) is a feasible solution to the linear programming relaxation 
P, and j € V is any demand point, the neighborhood of j, N(j), is the set of 
facilities that fractionally service j, that is, N(j) = {z G iF : Xij > 0}. 
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The following definition was crucial for the algorithm of Shmoys, Tardos, and 
Aardal [11]. 

Definition 2. Suppose that (x,y) is a feasible solution to the linear program- 
ming relaxation P and let gj > 0, for each j G V. Then (x, y) is g-close ifxij > 0 
implies that Cij < gj (j GV,iG T). 

Notice that if (x, y) is g-close and j G T> is any demand point, j is fractionally 
serviced by facilities inside the ball of radius gj centered at j. The following 
lemma is from [1 1] . 

Lemma 1. Given a feasible g-close solution (x,y), we can find, in polynomial 
time, a feasible integer 3g-close solution (x,y) such that 

■ 

We briefiy sketch the proof below. The algorithm can be divided into two 
steps: a clustering step and a facility opening step. The clustering step works as 
follows (see Table 1). Let S be the set of demand points that have not yet been 
assigned to any cluster; initially, S = T>. Find the unassigned demand point 
jo with smallest gj-vahie and create a new cluster centered at jo- Then all of 
the unassigned demand points that are fractionally serviced by facilities in the 
neighborhood of jo (that is, all the demand points k G S with N(fc) n N(jo) 0) 
are assigned to the cluster centered at jo] the set S is updated accordingly. 
Repeat the procedure until all the demand points are assigned to some cluster 
(i.e., S = 0). We will use C to denote the set of centers of the clusters. 



Table 1. The clustering construction of Shmoys, Tardos, and Aardal. 



l.S<-V, C^0 

2. while 5 yf 0 

3. choose jo G S with smallest gj value (j G S) 

4. create a new cluster Q centered at jo, C ^ CU {jo} 

5. Q^{fcG5:N(fc)nN(jo)y^0| 

6. S^S-Q 



The following fact follows easily from the clustering construction and the 
definition of neighborhood, and is essential for the success of the algorithm. 

Fact 1. Suppose that we run the clustering algorithm of Table 1, using any g- 
close solution (x,y), then: 

(a) neighborhoods of distinct centers are disjoint (i.e., if j and k are centers, 
j^kGC, then N(j) n N(fc) = %), 

(b) for every demand point k G T), X)i6N(fe) ~ ^ ■ 
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After the clustering step, the algorithm of [11] opens exactly one facility per 
cluster. For each center j S C we open the facility io in the neighborhood of j, 
N(j), with smallest fixed cost fi and assign all the demand points in the cluster 
of j to facility io. Observe that by inequalities (2), /u ^ 

Si6N(i) fiVi- Using Fact 1(a), the total facility cost incurred by the algorithm 
is never more than the total fractional facility cost fiVi- Next consider 

any demand point k € T) and suppose it belongs to the cluster centered at jo', 
let £ € N(/c) n N(jo) be a common neighbor and let i be the open facility in 
the neighborhood of jo (see Figure 1). Then, the distance from k to i can be 




Fig. 1. Bounding the service cost of k (4-approximation algorithm). The circles 
(•) are demand points, whereas the squares (■) are facility locations. 



bounded by the distance from k to £ (which is at most gu) plus the distance 
from £ to jo (which is at most plus the distance from jo to i (which is at 
most gj„). Thus, the distance from k to an opened facility is at most 2gj^ + g^, 
which is at most 3gk, since jo was the remaining demand point with minimum 
value. Hence the total service cost can be bounded by 3 9k- 

Shmoys, Tardos, and Aardal used the filtering technique of Lin & Vitter 
[8] to obtain g-close solutions and then applied Lemma 1 to obtain the first 
constant factor approximation algorithms for the problem. However, a simpler 
(/-close solution is directly obtained by using the optimal solution to the dual 
linear program of P. More precisely, the dual of the linear program P is given by 
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Max 


51 




(4) 




jev 






(D) subject to 




for each z G IF 


(5) 




jdV 








Vj ^ij — ^ij 


for each i G if, j G T> 


( 6 ) 




Wij > 0 


for each i G if, j G T> . 


(7) 



Fix an optimal primal solution (x*, y*) and an optimal dual solution (v* , w*), 
and let LP* be the optimal LP value. Complementary slackness gives that x*j > 0 
implies that v* — w*j = Cij ; since w*j > 0, we get the following lemma. 

Lemma 2. If{x*,y*) is an optimal solution to the primal linear program P and 
(v*,w*) is an optimal solution to the dual linear program D, then (x*,y*) is 
V* -close. 

By applying Lemma 1 to the optimal u*-close solution (x*,y*), we obtain a 
feasible solution for the problem with total facility cost at most fiVi 

with total service cost bounded by = 3 LP*. We can bound the sum 

of these by 4 LP*; thus we have a 4-approximation algorithm. Note the inbalance 
in bounding facility and service costs. 



3 A Randomized Algorithm 

After solving the linear program P, a very simple randomized algorithm is the 
following: open a facility at location i € T with probability y* independently 
for every i £ T, and then assign each demand point to its closest open facility. 
Notice that the expected facility cost is just fiVh the same bound as in the 

algorithm of Section 2. Focus on a demand point k € T>. If it happens that one 
of its neighbors has been opened, then the service cost of k would be bounded 
by the optimal dual variable v^. However, if we are unlucky and this is not 
the case (an event that can easily be shown to occur with probability at most 
1/e « 0.368, where the bound is tight), the service cost of k could be very large. 
On the other hand, suppose that we knew, for instance, that for the clustering 
computed in Section 2, k belongs to a cluster centered at j, where one of the 
facilities in N(j) has been opened. Then in this unlucky case we could bound the 
service cost of k using the routing cost of the 4-approximation algorithm. 

Our algorithm is also based on randomized rounding and the expected facility 
cost is fiV* ■ However, we weaken the randomized rounding step and do not 

open facilities independently with probability y*, but rather in a dependent way 
to ensure that each cluster center has one of its neighboring facilities opened. 

Even though the algorithms presented in this section work for any g-close 
feasible solution, for sake of simplicity we will assume as in the end of Section 2, 
that we have a fixed optimal primal solution {x*,y*) and a fixed optimal dual 
solution (v*,w*), so that (x*,y*) is u*-close. It is easy to see that we can assume 
that y* < 1 for each potential facility location z S IF. To motivate the following 
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definition, fix a demand location j G T>, and suppose without loss of generality 
that the neighbors of j (that is, the facilities i for which x*j > 0) are {1, . . . ,d} 
with cij < C 2 j < . . . < Cdj ■ Then it is clear that we can assume without loss of 
generality that j is assigned “as much as possible” to facility 1, then to facility 2 
and so on; that is, = yl, ■ ■ ■ , = Vd-i (but maybe x*j_j < yd)- 

Definition 3. A feasible solution (x,y) to the linear programming relaxation P 
is complete ifxij > 0 implies that Xij = y^, for every i G if, j G T>. 

Thus the optimal solution (x*,y*) is “almost” complete, in the sense that for 
every j G T> there is at most one i G if with 0 < x*j < y* . 

Lemma 3. Suppose that (x,y) is a feasible solution to the linear program P for 
a given instance of the uncapacitated facility location problem X. Then we can 
find in polynomial time an equivalent instance I and a complete feasible solution 
(i, y) to its linear programming relaxation with the same fractional facility and 
service costs as {x,y), that is, fiVt = fiVt = 

Moreover, if \x,y) is g-close, so is (x,y). □ 

Proof. Pick any facility i G T for which there is a demand point j G T> with 

0 < Xij < iji (if there is no such a facility, the original solution (fc, y) is complete 

and we are done). Among the demand points j G T> for which Xij > 0, let jo 
be the one with smallest Xij value. Next create a new facility location i' which 
is an exact copy of i (i.e. same fixed cost and in the same location), and set 
Vi' = yi~Xij„ and reset iji equal to Xij^ . Next for every j G T> with Xij > 0, reset 
Xij = Xij^ = iji, and set xuj = Xij — Xij^ (which is nonnegative by the choice 
of Jo). All the other components of a; and y remain unchanged. Clearly (x,y) is 
a feasible solution to the linear programming relaxation of the new instance; if 
(x,y) is (/-close so is {x,y). It is straightforward to verify that the new instance 
is equivalent to the old one (because there are no capacity restrictions) and that 
the fractional facility and service costs of the solutions {x, y) and (i, y) are the 
same. Since the number of pairs (fc,j) for which 0 < Xkj < y^ has decreased 
at least by one, and initially there can be at most iT’lllCj < such pairs, 
iterations suffice to construct a new instance with a complete solution. □ 

By Lemma 3, we can assume that {x* , y*) is complete. To understand some of 
the crucial points of our improved algorithm we will first consider the following 
RANDOMIZED ROUNDING WITH CLUSTERING. Suppose that we run the clustering 
procedure exactly as in Table 1, and let C be the set of cluster centers. We 
partition the facility locations into two classes, according to whether they are in 
the neighborhood of a cluster center or not. 

Definition 4. The set o/ central facility locations C is the set of facility locations 
that are in the neighborhood of some cluster center, that is, C = UjgcN(j); the 
remaining set of facility locations TZ = T — L are noncentral facility locations. 

The algorithm opens facilities in a slightly more complicated way that the 
simplest randomized rounding algorithm described in the beginning of the sec- 
tion. First we open exactly one central facility per cluster as follows: indepen- 
dently for each center j G C, open neighboring facility i G N(j) at random with 
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probability x*j (recall Fact 1(b)). Next independently we open each noncen- 
tral facility i £ TZ with probability y*. The algorithm then simply assigns each 
demand point to its closest open facility. 

Lemma 4. For each facility location i ^ T , the probability that a facility at 
location i is open is y*. 

Proof. If t is a noncentral facility (z S TZ), we open a facility at z with probability 
y*. Suppose next that z is a central facility {i G £), and assume that z G N(j), 
for a center j G C. A facility will be opened at location z only if the center j 
chooses it with probability x*j] but x*j = y* since (x*,y*) is complete. □ 

Corollary 2. The expected total facility cost is fiy* ■ 

For each demand point k G T>, let Ck denote the fractional service cost of k, 
that is, Ck = CikX*k- The expected service cost of fc G I? is bounded in the 

following lemma whose proof is presented below. 

Lemma 5. For each demand point k G V, the expected service cost of k is at 
most Ck + {3/e)vl. 

Overall, since expected total service cost can be bounded 

as follows. 

Corollary 3. The expected total service cost is at most Cfe -I- (3/e) LP*. 

By combining Corollaries 2 and 3, and noting that fiyt = 

LP* , we obtain the following. 

Theorem 2. The expected total cost incurred by randomized rounding with 
CLUSTERING is at most (1 -I- 3/e)LP*. 

Proof of Lemma 5. Fix a demand point k GT>. For future reference, let jo be the 
center of the cluster to which k belongs; notice that jo always has a neighboring 
facility Zo (zo G N(jo)) opened, hence its service cost is never greater than v*^. 
To gain some intuition behind the analysis, suppose first that each center in C 
shares at most one neighbor with k; that is, |N(/) n N(/c)| < 1, for each center 
j G C. Each neighbor z G N(/c) is opened with probability y* = x*f. independently 
in this special case. For notational simplicity suppose that N(fc) = {1,... ,d}, 
with cik < . . . < Cdk- Let q be the probability that none of the facilities in N(/c) 
is open. Note that q = rii=i(l ~ vt) = rii=i(l ~ ^*k)- observation is 

that q is “not too big”: since 1 — a; < (a; > 0), using Fact 1(b), 

d d / 

9 = 11(1 - x*k) < n = exp I - ^ x*k 

i=l i=l \ i=l 

where exp (a;) = e“. We will bound the expected service cost of k by considering 
a clearly worse algorithm: assign k to its closest open neighbor; if none of the 
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neighbors of k is open, assign k to the open facility io G N(jo) (exactly as in 
Section 2). If facility 1 is open, an event which occurs with probability y^, the 
service cost of k is cik- If, on the other hand, facility 1 is closed, but facility 2 
is open, an event which occurs with probability (1 — y*)y 2 , the service cost of k 
is C 2 fe, and so on. If all the facilities in the neighborhood of k are closed, which 
occurs with probability q, then k is assigned to the open facility io G N(jo). But 
in this case, k is serviced by io, so the service cost of k is at most 2v*^ +^1 < 
exactly as in Figure 1 (Section 2); in fact, this backup routing gives a deterministic 
bound: the service cost of k is always no more than 3f^. Thus the expected 
service cost of k is at most 

cikVl + C 2 fc 1 / 2(1 - 2 / 1 ) H ^ Cdfc y*d{l - 1 / 1 ) ... (1 - yl_i) + ivlq 

< I] + —iivl = Ck+—vl , 
e e 

which concludes the proof of the lemma in this special case. 

Now we return to the more general case in which there are centers in C that 
can share more than one neighbor with k. We assumed that this was not the case 
in order to ensure that the events of opening facilities in N(/c) were independent, 
but now this is no longer true for facilities i,i' G N(fc) that are neighbors of the 
same center. However, if one of i or i' is closed, the probability that the other 
is open increases; thus the dependencies are favorable for the analysis. The key 
idea of the proof is to group together those facilities that are neighbors of the 
same cluster center, so that the independence is retained and the proof of the 
special case above still works. A more rigorous analysis follows. 

Let C be the subset of centers that share neighbors with k. For each center 
j G C, let Sj = N(j) n N(fc), and so C = {j G C : Sj yf 0}. We have already proved 
the lemma when jAyl < 1, for each center j G C. For each center j G C, let Ej be 
the event that at least one common neighbor of j and k is open (see Figure 2). 
To follow the proof, for each j G C, it will be convenient to think of the event 
of choosing facility i in Sj as a sequence of two events: first j chooses to “open” 
Sj with probability pj = ^tk event Ej occurs); and then if Sj is 

open, j chooses facility i G Sj with probability x*jjpj (which is the conditional 
probability of opening i given event Ej). Now let Cj = ^ik'^XklVj', that is, 

Cj is the conditional expected distance from k to Sj, given the event Ej. For 
example, if Sj = {r, s,t} are the common neighbors of j and k, the event Ej 
occurs when one of r,s or t is open, pj = + x^^, and Cj = CrkX*j./pj + 

^skX*i,/pj + ctkxlf./pj. Notice that by Fact 1(a), the events Ej {j G C) are 
independent. This completes the facility grouping. Consider the neighbors of k 
that are noncentral facility locations; that is, locations i G N(fc) n 7^. For each 
each noncentral neighbor i G N(fc) n 7^, let Ei be the event in which facility i is 
open, Ci be the distance Cik, and pi = x*^. Next notice that all of the events Ei 
are independent. It follows easily from the definitions that '^iPi = 1 

and Cepe = C'k . 
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Now we can argue essentially as in the simple case when jS'jl < 1 for each 
center j G C. Assume that there are d events Ei, and for notational simplicity, 
they are indexed by ^ G { 1 , . . . , d}, with Ci < . . . < Cd- Let D be the event 
that none of Ei, . . . ,Ed occurs; that is, D is precisely the event in which all 
the facilities in the neighborhood of A:, N(fc), are closed; let q be the probability 
of event D. Note that, as in the simple case, the service cost of k is never 
greater that its backup routing cost 3 u^, in particular, this bound holds even 
conditioned on D. As before, we will analyze the expected service cost of a worse 
algorithm: k is assigned to the open neighboring facility with smallest Cf, and if 
all the neighbors are closed, k is assigned through its backup routing to the open 
facility io G N(jo). If the event E\ occurs (with probability pi), the expected 




Fig. 2. Estimating the expected service cost of k. Here the centers that share a 
neighbor with k are demand locations 2 and 4 (Cfc = {2,4}). The neighbors of 
k that are noncentral locations are 1 and 3 . Event E2 (respectively E4) occurs 
when a facility in N(fc) n N(2) (respectively N(fc) n N(4)) is open, while event Ei 
(respectively E3) occurs when facility 1 (respectively 3 ) is open. Though there 
are dependencies among the neighbors of a fixed center, the events Ei, E2, E3 
and E4 are independent. 



service cost of A: is Ci. If event Ei does not occur, but event E2 occurs (which 
happens with probability (1 — pi)p2), the expected service cost of k is C2, and 





Approximation Algorithms for Uncapacitated Facility Location 191 



SO on. If we are in the complementary space D, which occurs with probability 
q = rit=i(^ ~ Pt), the service cost of k is never greater than its backup service 
cost Svl- Thus the expected service cost of k can be bounded by 

Cipi + C 2 (1 -pi)p2 H \-Cd{l-pi)---{l-Pd-i)Pd + 3vlq . (8) 

To prove the lemma we bound the first d terms of (8) by Ck, and g by 1/e. □ 



Notice than even though the clustering construction is deterministic, the 
backup service cost of k (that is, the distance between k and the facility open 
in N(jo)) is a random variable B. In the proof above, we used the upper bound 
B < 3u/. In fact, the proof of Lemma 5 shows that the expected service cost of 
k is no more than Ck + q As it can be easily seen, the upper bound used 

for equation (8) is not tight. In fact, we can get an upper bound of (1 — g) Cfc + 
gE[i?|I?] as follows. First note the following simple probabilistic interpretation 
of the first d terms of (8). Let Zg {i = 1,... ,d) be independent 0-1 random 
variables with ProbjZ^ = 1} = pi. Consider the set of indices for which is 1, 
and let Z be the minimum Ci value in this set of indices; if all of the Zg are 0, Z is 
defined to be 0. Then the expected value of Z is exactly equal to the first d terms 
of (8). Given a set of numbers S, we will use miuo (S') to denote the smallest 
element of S if S is nonempty, and 0 if S is empty, so that Z = mino{^^=i} CiZ^. 
The following intuitive probability lemma provides a bound on the first d terms 
of (8). 



Lemma 6. Suppose that 0 < Ci < . . . < Cd, Pi, ■ ■ ■ ,Pd > 0, with = 1- 

Let Zi,... ,Zd be 0-1 independent random variables, with ProbjZ^ = 1} = pg. 
Let C = Y^ C'epi. Then 

r d 1 



E 



minoCiZi U(^-Z,)C 

‘ 1=1 



< c . 



□ 



Applying the lemma to the first d terms of equation (8), since E[]/[^^^(l — Z^)] = 
Yle=ii^ ~ Pt) — 1, we have that 



Cipi + C2(l - pi)p2 H h Cd(l - pi) ... (1 - Pd-i)Pd < Cfc(l - g) . ( 9 ) 



Thus we have proved the following. 

Lemma 7. For each demand point k € T), the expected service cost of k is at 
most (1 — g) Cfe -I- g E[B|D]. 

Finally we introduce the last idea that leads to the (l-|-2/e)-approximation al- 
gorithm. In Figure I, we have bounded the distance from center jo to the open fa- 
cility i, Cij^, by v*^. However, i is selected (by the center jo) with probability x*j^ 
and, thus, the expected length of this leg of the routing is Cij^xL^ = Cj^, 

which in general is smaller than the estimate v*^ used in the proof of Lemma 5. 
Thus, to improve our bounds, we slightly modify the clustering procedure by 
changing line 3 of the procedure of Table 1 to 




192 



Fabian A. Chudak 



3’. choose jo G S with smallest v* + Cj value (j G S) 



We will call the modified algorithm randomized rounding with improved 
CLUSTERING. Notice that Lemmas 4 and 7 are unaffected by this change. We will 
show that the modified rule 3’ leads to the bound E[i?|D] < 2v’^ + Ck, improving 
on the bound of we used in the proof of Lemma 5. 

Lemma 8. If we run randomized rounding with improved clustering, 
the conditional expected backup service cost of k, E[B|Z9], is at most + Ck- 

Proof. Suppose that the clustering partition assigned k to the cluster with center 
jo- Deterministically, we divide the proof into two cases. 

Case 1. Suppose that there is a facility ^ G N(fc) n N(jo), such that 
(see Figure 3(a)). Let i be the facility in N(jo) that was opened by jo] notice 
that Cij^ < v*^ (because (x,y) is u*-close). Then the service cost of k is at most 
Cik < Cki + Cij^ + Cj^i, which using again that (x*,y*) is u*-close, is at most 
< Pfc + — C'fc + where the last inequality follows 

from the fact that the center has the minimum (Cj + Pj) value. In this case, we 
have a (deterministic) bound, B <Ck + 2u^. 

Case 2. Assume that > Cj^ for every i G N(fc) n N(jo) (see Figure 
3(b)). First note that when we do not condition on D (i.e., that no facility 
in N(fc) is open), then the expected length of the edge from jo to the facility 
that jo has selected is Cj^. However, we are given that all the facilities in the 
neighborhood of k are closed, but in this case, all of these facilities that contribute 
to the expected service cost of jo (the facilities in N(fc) n N(jo)) are at distance 
greater than the average Cj^ . Thus the conditional expected service cost of jo 
is at most the unconditional expected service cost of jo, Cj^. It follows then 
that if ^ G N(fc) n N(jo), the conditional expected service cost of k is at most 
Cjo 'P ^ “I” '^ja T Hence, E[H|Z?] < Ck + 2u^ in this 

case too. □ 

Thus, using Lemmas 7 and 8, the expected service cost of k can be bounded by 

C'fe(l — (?) + 9 (2Pfc + C'fc) = Ck ~\~2qVk < Cfc H Vk , 

e 

where once again we bound q by 1/e. 

Corollary 4. The expected total service cost of randomized rounding with 
improved clustering is at most J 2 kev + (2/e) J 2 kev ■ 

Combining Corollaries 2 and 4, randomized rounding with improved 
CLUSTERING produces a feasible solution with expected cost no greater than 
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Fig. 3. Bounding the backup service cost of k. 



Thus we have proved the following theorem. 



Theorem 3. There is a polynomial-time randomized algorithm that finds a fea- 
sible solution to the uncapacitated facility location problem with expected cost at 
most (1 + 2/e)LP*. 

To finish the proof of Theorem 1 we can show that the algorithm of Theorem 3 
can be derandomized using the standard method of conditional expectations. 

4 Discussion 

We conclude with a few remarks concerning the algorithm of Section 3. A stan- 
dard technique to improve the guarantees of randomized rounding is to use a 
fixed parameter, say 7 > 0, to boost the probabilities. For instance, the sim- 
plest randomized rounding algorithm would open facility i & !F with probability 
minjy?/*, 1}. This technique can be also applied to our randomized algorithm 
in a very simple fashion. The bounds of Theorem 3 are thus parameterized for 
each 7 . Even though this does not lead to an overall improvement in the perfor- 
mance guarantee, it allows us to improve the performance guarantee for some 
values of p, where p G [0,1] is defined by pLP* = '^i^j^fiy*- In particular, 
we can show that \i p <2je ~ 0.736, there is a variant of our algorithm with 
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performance guarantee pln(2/p) + 1. This bound gets better as p approaches 
0. Another algorithm is obtained by applying Theorem 3 to the filtering g-close 
solution proposed by Shmoys, Tardos, and Aardal. This new algorithm performs 
better when p approaches 1 . Thus for “most” values of p, we can provide a better 
guarantee than the one of Theorem 1. For example, the worst possible guarantee 

I. 736 is achieved only when p is “close” to 0.70. 

Another interesting issue concerns the proof of Theorem 3. In particular, 
for the bound q < 1/e to be tight, it requires having many terms {d has to be 
relatively large) and all the x values have to be the same. This bad instance 
seems very unlikely to occur. Therefore for a particular solution, a more careful 
bookkeeping of the constants can provide a substantially improved guarantee. 

Finally, our work also indicates that it might be worthwhile to investigate 
the of randomization for more complex location problems. 
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Abstract. We consider the traveling salesman problem when the cities 
are points in R'* for some fixed d and distances are computed according 
to a polyhedral norm. We show that for any such norm, the problem of 
finding a tour of maximum length can be solved in polynomial time. If 
arithmetic operations are assumed to take unit time, our algorithms run 
in time 0(n^~^ logn), where / is the number of facets of the polyhedron 
determining the polyhedral norm. Thus for example we have 0{n^ logoi') 
algorithms for the cases of points in the plane under the Rectilinear and 
Sup norms. This is in contrast to the fact that finding a minimum length 
tour in each case is NP-hard. 



1 Introduction 

In the Traveling Salesman Problem (TSP), the input consists of a set C of cities 
together with the distances d{c, c') between every pair of distinct cities c, c' G C. 
The goal is to find an ordering or tour of the cities that minimizes (Minimum 
TSP) or maximizes (Maximum TSP) the total tour length. Here the length of a 

tour C7r(l) J 5 ■ ■ ■ 5 ^TT{n) 

n— 1 

^ ^ ^(d-7r(i) 5 ) “b ^(d7r(n) j ^7t{1) ) ■ 

i=l 
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Of particular interest are geometric instances of the TSP, in which cities 
correspond to points in for some d>l, and distances are computed according 
to some geometric norm. Perhaps the most popular norms are the Rectilinear, 
Euclidean, and Sup norms. These are examples of what is known as an 
norm” for p = 1, 2, and oo. In general, the distance between two points x = 
{xi,X2, ■■■,Xd) and y = (yi, ?/ 2 , ■ ■ ■ ,Vd) under the U norm, p > 1, is 

c^(x,y) = ( X! 1^* “ 

\i=l 

with the natural asymptotic interpretation that distance under the L°° norm is 

d(x, y) = \x^ - yi\ . 



This paper concentrates on a second class of norms which also includes the 
Rectilinear and Sup norms, but can only approximate the Euclidean and other 
norms. This is the class of polyhedral norms. Each polyhedral norm is determined 
by a unit ball which is a centrally-symmetric polyhedron P with the origin at its 
center. To determine d(x, y) under such a norm, first translate the space so that 
one of the points, say x, is at the origin. Then determine the unique factor a by 
which one must rescale P (expanding if a > 1, shrinking if a < 1) so that the 
other point (y) is on the boundary of the polyhedron. We then have d(x, y) = a. 

Alternatively, and more usefully for our purposes, we can view a polyhedral 
norm as follows. If P is a polyhedron as described above and has F facets, then 
F is divisible by 2 and there is a set i/p = {hi , . . . ,hp/ 2 } of points in R'^ such 
that P is the intersection of a collection of half-spaces determined by Hp : 



> F/2 



' F/2 



P = ( n |x : X • hj < 1} j n [ Pi |x : x • h* > -1} 

Then we have 



. i=l 



, i=l 



d(x, y) = max | (x - y) • h^ : 1 < z < F/2^ 



Note that for the Rectilinear norm in the plane we can take iJp = {(1, 1), (—1, 1)} 
and for the Sup norm in the plane we can take Flp = {(1, 0), (0, 1)}. 

For the Minimum TSP on geometric instances, all of the key complexity 
questions have been answered. As follows from results of Itai, Papadimitriou, 
and Swarcfiter [5], the Minimum TSP is NP-hard for any fixed dimension d and 
any or polyhedral norm. On the other hand, recent results of Arora [1,2] and 
Mitchell [8] imply that in all these cases a polynomial-time approximation scheme 
(PTAS) exists, i.e., a sequence of polynomial-time algorithms 1 < fc < oo, 
where Ak is guaranteed to find a tour whose length is within a ratio of 1 -I- (1/ A:) 
of optimal. 

The situation for geometric versions of the Maximum TSP is less completely 
resolved. Barvinok [3] has shown that once again polynomial-time approximation 




The Maximum TSP 



197 



schemes exist for all fixed dimensions d and all or polyhedral norms (and in 
a sense for any fixed norm; see [3]). Until now, however, the complexity of the 
optimization problems themselves when d is fixed has remained open: For no 
fixed dimension d and or polyhedral norm was the problem of determining 
the maximum tour length known either to be NP-hard or to be polynomial-time 
solvable. In this paper, we resolve the question for polyhedral norms, showing 
that, in contrast to the case for the Minimum TSP, the Maximum TSP is solvable 
in polynomial time for any fixed dimension d and any polyhedral norm: 

Theorem 1. Let dimension d be fixed, and let || • || be a fixed polyhedral norm in 

whose unit ball is a polyhedron P determined by a set of f facets. Then for any 
set of n points in one can construct a traveling salesman tour of maximum 
length with respect to || • || in time logn), assuming arithmetic operations 

take unit time. 

As an immediate consequence of Theorem 1, we get relatively efficient algo- 
rithms for the Maximum TSP in the plane under Rectilinear and Sup norms: 

Corollary 2. The Maximum TSP for points in R^ under the and L°° norms 
can be solved in 0{n^ logn) time, assuming arithmetic operations take unit time. 

The restriction to unit cost arithmetic operations in Theorem 1 and Corol- 
lary 2 is made primarily to simplify the statements of the conclusions, although 
it does refiect that fact that our results hold for the real number RAM compu- 
tational model. Suppose on the other hand that one assumes, as one typically 
must for complexity theory results, that the components of the vectors in i/p 
and the coordinates of the cities are all rationals. Let U denote the maximum 
absolute value of any of the corresponding numerators and denominators. Then 
the conclusions of the Theorem and Corollary hold with running times multi- 
plied by nlog{U). If the components/coordinates are all integers with maximum 
absolute value U, the running times need only be multiplied by log(nU). For 
simplicity in the remainder of this paper, we shall stick to the model in which 
numbers can be arbitrary reals and arithmetic operations take unit time. The 
reader should have no trouble deriving the above variants. 

The paper is organized as follows. Section 2 introduces a new special case 
of the TSP, the Tunneling TSP, and shows how the Maximum TSP under a 
polyhedral norm can be reduced to the Tunneling TSP with the same number of 
cities and //2 tunnels. Section 3 sketches how the latter problem can be solved 
in time, a slightly weaker result than that claimed in Theorem 1. The 

details of how to improve this running time to log n) will be presented 

in the full version of this paper, a draft of which is available from the authors. 
Section 4 concludes by describing some related results and open problems. 

2 The Tunneling TSP 

The Tunneling TSP is a special case of the Maximum TSP in which distances 
are determined by what we shall call a tunnel system distance function. In such a 
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distance function we are given a set T = ■■■ ,tk} of auxiliary objects that 

we shall call tunnels. Each tunnel is viewed as a bidirectional passage having 
a front and a back end. For each pair c, t of a city and a tunnel we are given 
real-valued access distances F{c,t) and B{c,t) from the city to the front and 
back ends of the tunnel respectively. Each potential tour edge {c, c'} must pass 
through some tunnel t, either by entering the front end and leaving the back (for 
a distance of F{c, t) + B{c', t)), or by entering the back end and leaving the front 
(for a distance of B{c, t) + F{c' , t)). Since we are looking for a tour of maximum 
length, we can thus define the distance between cities c and c' to be 

d(c, c') = max |e(c, U) + B{d , U), B{c, ti) + F(c', U) : 1 < i < 



Note that this distance function, like our geometric norms, is symmetric. 

It is easy to see that Maximum TSP remains NP-hard when distances are 
determined by arbitrary tunnel system distance functions. However, for the case 
where fc = |T| is fixed and not part of the input, we will show in the next section 
that Maximum TSP can be solved in 0(n^^“^) time. We are interested in this 
special case because of the following lemma. 

Lemma 3. If || • || zs a polyhedral norm determined by a set Hp of k vectors 
in then for any set C of points in one can in time 0{dk\C\) construct a 
tunnel system distance function with k tunnels that yields d(c,c') =|| c — c' || for 
all c, c' G C. 

Proof. The polyhedral distance between two cities c, c' G R*^ is 



c — c = max 



{ (c - c') 



hi 



: 1< z < fc 



} 



= max |(c — c') • hi, (c' — c) • hi : 1 < z < 



Thus we can view the distance function determined by || • || as a tunnel system 
distance function with set of tunnels T = Hp and E(c, h) = c • h, B{c, h) = — c • h 
for all cities c and tunnels h. □ 



3 An Algorithm for Bounded Tunnel Systems 

This section is devoted to the proof of the following lemma, which together with 
Lemma 3 implies that the Maximum TSP problem for a fixed polyhedral norm 
with / facets can be solved in time. 

Lemma 4. If the number of tunnels is fixed at k, the Tunneling TSP can be 
solved in time assuming arithmetic operations take unit time. 

Proof. Suppose we are given an instance of the Tunneling TSP with sets C = 
{ci,...,c„} and T = {ti,...,tk} of cities and tunnels, and access distances 
F{c,t), B{c,t) for all c G C and t £ T. We begin by transforming the problem 
to one about subset construction. 
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Let G = {C LI T,E) be an edge-weighted, bipartite multigraph with four 
edges between each city c and tunnel t, denoted by ei[c,t,X], i e {1, 2} and X e 
The weights of these edges are w(ei[c, t,F]) = F{c,t) and w{ei[c, t,B]) = 
B{c,t), i e {1,2}. For notational convenience, let us partition the edges in F 
into sets F[t,F] = {ei[c,t,F] : c € C,i & {1)2}} and F[t,B] = {ei[c,t,B] : c € 
C,i & {1)2}}, t G T. Each tour for the TSP instance then corresponds to a 
subset F' of F that has equal to the tour length and satisfies 

(Tl) Every city is incident to exactly two edges in E' . 

(T2) For each tunnel t G T, \E' C\ E[t, F] \ = \E' C\ E[t, B]\. 

(T3) The set E' is connected. 

To construct the multiset E', we simply represent each tour edge {c, c'} by a 
pair of edges from E that connect in the appropriate way to the tunnel that 
determines d{c,c'). For example, if d{c,c') = F{c,t) + and c appears 

immediately before c' when the tour is traversed starting from then the 

edge (c, c') can be represented by the two edges e 2 [c,t,F] and ei[c',t,B], Note 
that there are enough (city,tunnel) edges of each type so that all tour edges can 
be represented, even if a given city uses the same tunnel endpoint for both its 
tour edges. Also note that if d(c,c') can be realized in more than one way, the 
multiset E' will not be unique. However, any multiset E' constructed in this 
fashion will still have Xees' equal to the tour length. 

On the other hand, any set E' satisfying (Tl) ~ (T3) corresponds to one (or 
more) tours having length at least Xees' w{e): Let T' G T be the set of tunnels t 
with \E' LE[t,F]\ > 0. Then G' = {CL T', E') is a connected graph all of whose 
vertex degrees are even by (Tl) - (T3). By an easy result from graph theory, 
this means that G' contains an Euler tour that by (Tl) passes through each 
city exactly once, thus inducing a TSP tour for C. Moreover, by (T2) one can 
construct such an Euler tour with the additional property that if ei[c,t,x] and 
6j [d , t, y] are consecutive edges in this tour, then x ^ y, i.e, either x = F,y = B 
or X = B,y = F. Thus we will have w(ei[c,t,x]) -I- w{ej[c' ,t,y]) < d{c,c'), and 
hence the length of the TSP tour will be at least Xees' ^(s)) claimed. 

Thus our problem is reduced to finding a maximum weight set of edges E' C 
E satisfying (Tl) - (T3). We will now sketch our approach to solving this latter 
problem; full details are available from the authors and will appear in the journal 
version of this paper. The basic idea is to divide into 0{rd^~^) subproblems, each 
of which can be solved in linear time. Each subcase corresponds to a choice of 
a degree sequence d for the tunnels, for which there are 0{rd~^) possibilities, 
and a choice, for that degree sequence, of a canonical-form sequence s of edges 
that connects together those tunnels that have positive degree, for which there 
are again 0{rd~^) possibilities. 

Having chosen d and s, what we are left with is a maximum weight bipartite 
6-matching problem: starting with a set consisting of the edges specified by s, 
each tunnel end must have its degree augmented up to that specified by d and 
each city must have its degree augmented up to 2. This 6-matching problem 
can be solved in 0{rd) time by the standard technique that converts it to an 
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assignment problem on an expanded graph. The overall running time for the 
algorithm is thus as claimed. □ 

In the full paper we show how two additional ideas enable us to reduce our 
running times to log n), as needed for the proof of Theorem 1. The 

first idea is to view each 6-matching problem as a transportation problem with 
a bounded number of customer locations. This latter problem can be solved 
in linear time by combining ideas from [7,4,11]. The second idea is to exploit 
the similarities between the transportation instances we need to solve. Here a 
standard concavity result implies that one dimension of our search over degree 
sequences can be handled by a binary search. In the full paper we also discuss 
how the constants involved in our algorithms grow with k. 

4 Conclusion 

We have derived a polynomial time algorithm for the Maximum TSP when the 
cities are points in for some fixed d and when the distances are measured 
according to some polyhedral norm. The complexity of the Maximum TSP with 
Euclidean distances and fixed d remains unsettled, however, even for d = 2. 
Although the Euclidean norm can be approximated arbitrarily closely by poly- 
hedral norms (and hence Barvinok’s result [3] that Maximum TSP has a PTAS), 
it is not itself a polyhedral norm. 

A further difficulty with the Euclidean norm (one shared by both the Min- 
imum and Maximum TSP) is that we still do not know whether the TSP is in 
NP under this norm. Even if all city coordinates are rationals, we do not know 
how to compare a tour length to a given rational target in less than exponential 
time. Such a comparison would appear to require us to evaluate a sum of n 
square roots to some precision, and currently the best upper bound known on 
the number of bits of precision needed to insure a correct answer remains ex- 
ponential in n. Thus even if we were to produce an algorithm for the Euclidean 
Maximum TSP that ran in polynomial time when arithmetic operations (and 
comparisons) take unit time, it might not run in polynomial time on a standard 
Turing machine. 

Another set of questions concerns the complexity of the Maximum TSP when 
d is not fixed. It is relatively easy to show that the problem is NP-hard for all 
LP norms (the most natural norms that are defined for all d > 0). For the case 
of L°° one can use a transformation from Hamiltonian Circuit in which each 
edge is represented by a separate dimension. For the norms, 1 < p < oo, 
one can use a transformation from the Hamiltonian Circuit problem for cubic 
graphs, with a dimension for each non-edge. However, this still leaves open the 
question of whether there might exist a PTAS for any such norm when d is not 
fixed. Trevisan [10] has shown that the Minimum TSP is Max SNP-hard for any 
such norm, and so cannot have such PTAS’s unless P = NP. We can obtain a 
similar result for the Maximum TSP under L°° by modifying our NP-hardness 
transformation so that the source problem is the Minimum TSP with all edge 
lengths in {1, 2}, a special case that was proved Max SNP-hard by Papadimitriou 
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and Yannakakis [9]. The question remains open for 1 < p < oo, although we 
conjecture that these cases are Max SNP-hard as well. 

Finally, we note that our results can be extended in several ways. For instance, 
one can get polynomial-time algorithms for asymmetric versions of the Maximum 
TSP in which distances are computed based on non-symmetric unit balls. Also, 
algorithmic approaches analogous to ours can be applied to geometric versions of 
other NP-hard maximization problems: For example, consider the Weighted 3- 
Dimensional Matching Problem that consists in partitioning a set of 3n elements 
into n triples of maximum total weight. The special case where the elements 
are points in and where the weight of a triple equals the perimeter of the 
corresponding triangle measured according to some fixed polyhedral norm can 
be solved in polynomial time. 

Acknowledgment. We thank Arie Tamir for helpful comments on a preliminary 
version of this paper, and in particular for pointing out that a speedup of O(n^) 
could be obtained by using the transportation problem results of [7] and [11]. 
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Abstract. Many chemical properties of benzenoid hydrocarbons can 
be understood in terms of the maximum number of mutually resonant 
hexagons, or Clar number, of the molecules. Hansen and Zheng (1994) 
formulated this problem as an integer program and conjectured, based on 
computational evidence, that solving the linear programming relaxation 
always yields integral solutions. We establish their conjecture by showing 
that the constraint matrices of these problems are unimodular. 

Previously, Hansen and Zheng (1992) showed that a certain minimum 
weight cut cover problem defined for benzenoids yields an upper bound 
for the Clar number and conjectured that equality always holds. We 
prove that strong duality holds by formulating a network flow problem 
that can be used to compute the Clar number. We show that our results 
extend to generalizations of the Clar number and cut cover problems 
defined for plane graphs that are bipartite and 2-connected. 



1 Introduction 

In this paper we study optimization problems defined for maps of graphs that 
are bipartite and 2-connected. These problems are generalizations of ones that 
arise in chemical graph theory, specifically those encountered in the analysis of 
benzenoid hydrocarbons. 

Benzenoid hydrocarbons are organic molecules composed of carbon and hy- 
drogen atoms organized into connected (hexagonal) benzene rings. The structure 
of such a molecule is usually represented by a benzenoid system, a bipartite and 
two-connected plane graph whose interior faces are all regular hexagons. Each 
node of a benzenoid system represents a carbon atom, and each edge corresponds 
to a single or double bond between a pair of carbon atoms. The hydrogen atoms 
are not explicitly represented since their location is immediately deduced from 
the configuration of the carbon atoms. 

R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 202-212, 1998. © Springer-Verlag Berlin Heidelberg 1998 
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Generally, there is more than one way to arrange n carbon atoms into h 
hexagonal rings where each arrangement has n — 2(h— 1) hydrogen atoms. The 
chemical formula C„H„_ 2 (ft_i) represents a series of benzenoid isomers, struc- 
turally distinct molecules having the same atomic makeup. However, not all 
possible arrangements correspond to actual molecules. For benzenoid molecules, 
empirical evidence [4,5] shows that their graphs always have a perfect matching. 

In chemical graph theory, perfect matchings are called Kekule structures. A 
benzenoid system will usually have many Kekule structures; analyzing them is 
of interest to chemists and gives rise to interesting combinatorial optimization 
problems [8]. For example, the Clar number of a benzenoid system is the opti- 
mal value of a particular maximization problem over its set of Kekule structures 
and is a key concept in the aromatic sextet theory developed by E. Clar [3] to 
explain benzenoid phenomenology. It has been observed that as the Clar number 
increases within an isomeric series of benzenoid hydrocarbons, (1) isomeric sta- 
bility increases, so chemical reactivity decreases, and (2) the absorption bands 
shift towards shorter wavelengths, so the isomer colors change from dark blue- 
green to red, yellow or white [5]. It has also been demonstrated that the Clar 
number provides a rough estimate of the Dewar-type resonance energy [1]. 

Hansen and Zheng [7] formulated the computation of the Clar number as an 
integer (linear) program and conjectured, based on empirical results, that solving 
its linear programming relaxation yields integer solutions for all benzenoids. 
We prove here that the constraint matrix of this integer program is always 
unimodular. In particular, this establishes that the relaxation polytope is integral 
since the linear program is in standard form. 

Interestingly, these unimodular constraint matrices coming from an applied 
problem constitute an unusual case: they are not, in general, totally unimodular 
as often occurs with optimization problems on graphs that give rise to integral 
polyhedra. However, for the Clar problem, we remark that a subset of them are 
(also) totally unimodular, namely those constraint matrices corresponding to a 
natural subset of benzenoids called catacondensed benzenoid hydrocarbons. 

In a previous paper, Hansen and Zheng [6] considered a minimization problem 
for benzenoid systems that we call here the minimum weight cut cover problem. 
They showed that the optimal value of the cover problem is an upper bound for 
the Clar number and conjectured that equality always holds. We prove this is the 
case by formulating the minimum weight cut cover problem as a network flow 
problem and using its dual to construct a solution for the Clar number problem. 
As a consequence, the Clar number and the minimum weight cut cover problems 
can be solved in strongly polynomial time using a minimum cost network flow 
algorithm. Our results are established for the generalized versions of the Clar 
number and cut cover problems that are defined here. 

2 Preliminaries and Background 

In this paper we consider combinatorial optimization problems defined for plane 
graphs that are bipartite and 2-connected. The undefined graph theory terminol- 
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ogy we use is standard and may be found in [2] or [9]. Here {V, E, F) will always 
denote the plane map with set of finite faces F that results from embedding a 
bipartite and 2-connected planar graph G = (H, E). We will further assume that 
the nodes of V are colored black or white so that all edges connect nodes of 
different color. 

Since G is bipartite and 2-connected, the boundary of each face f G F 
is an even cycle which can be perfectly matched in two different ways. Each 
of these two possible matchings is called a frame of the face. A frame of a 
face / is clockwise oriented if each frame edge is drawn from a black node to 
white node in a clockwise direction along the boundary of /. Otherwise, the 
frame is counterclockwise oriented. A framework for {V,E,F) is a mapping Lp : 
F I— >■ E such that <p{f) is a frame for each f G F. A framework tp is simple if 
fif) O 'p{f') = 0 whenever / yf f. An oriented framework has all frames with 
the same orientation. It follows that oriented frameworks are simple. 

A set of node-disjoint faces F' C F is called a resonant set for {V, E, F) 
if there exists a perfect matching for G that simultaneously contains a frame 
for each face in F'. We denote by res{V,E,F) the maximum cardinality of a 
resonant set for (V, E, F). The maximum resonant set problem seeks to determine 
res(V, E, F). It can be viewed as a set partitioning problem where nodes in a 
plane map must be covered by faces or edges, and the objective is to maximize 
the number of faces in the partition. Of course, this problem is infeasible if G 
does not have a perfect matching. 

When {V, E, F) is a benzenoid system, res(E, E, F) is called its Clar number. 
A Clar structure is a representation of an optimal solution to this problem where 
the (hexagonal) faces and edges in the partition have inscribed circles and double 
lines, respectively (see Fig. 1). 






Fig. 1. Clar structures of different benzenoids 



We recall that a basis of a matrix is a maximal linearly independent col- 
umn submatrix. An integer matrix is called unimodular if, for each basis, the 
determinants of the largest submatrices are relatively prime. A matrix is totally 
unimodular if the determinant of every square submatrix is 0, 1, or -1. Totally 
unimodular matrices are unimodular. The relationship between unimodular ma- 
trices and integral polyhedra is stated in a theorem established by Veinott and 
Dantzig [13] and extended by Truemper [12] to matrices that do not have full row 
rank. To simplify the presentation of results, we consider an empty polyhedron 
to be integral. 
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Theorem 1 ([13], [12]). An integer matrix A is unimodular if and only if the 
polyhedron {x : Ax = b, a; > 0} zs integral for any integer vector b. 

An integer matrix M is column eulerian if Ml = 0 (mod 2), i.e., if the sum 
of its columns is a vector with all entries even. Truemper gave the following 
sufficient condition for a matrix to be unimodular. 

Theorem 2 ([12]). An integer matrix A is unimodular if: 

1. A = BC, with B a unimodular basis of A and each entry of C is 0, 1 or -1. 

2. No column eulerian column submatrix of A is linearly independent. 

Throughout this paper we assume familiarity with basic properties of totally 
unimodular matrices as can be found in [10] or [11]. 



3 Two Useful Propositions 

We say two subgraphs of a plane graph are intrafacial if they are edge-disjoint 
and each one is contained in the closure of a face of the other. Let Gi and G 2 be 
two intrafacial subgraphs. We say Gi is interior to G 2 and G 2 is exterior to G\ 
if Gi is contained in the closure of an interior face of G 2 . We say Gi and G 2 are 
mutually exterior if each subgraph is contained in the exterior face of the other 
one. 

Proposition 3. A plane graph with all even degree nodes can be decomposed 
into pairwise intrafacial cycles. 

Proof. We outline here an iterative procedure that yields the desired decom- 
position. Note that the blocks of a plane graph with all even degree nodes are 
eulerian, 2-connected and pairwise intrafacial. Since a block is 2-connected, the 
boundary of each of its faces is a cycle. Removing such a cycle from the original 
graph results in a new graph whose blocks continue to be eulerian, 2-connected 
and pairwise intrafacial. Furthermore, the removed cycle is intrafacial with re- 
spect to the new graph. □ 

An (even) cycle is signed if each edge is alternatively assigned a value of 1 
or -1. The signing of a plane cycle is clockwise if edges drawn from a black to 
a white node in a clockwise direction along the cycle are given a value of 1; 
otherwise the signing is counterclockwise. For brevity in the following proof, we 
say a face and a cycle are frame adjacent if the face’s frame and the cycle have 
common edges. 

Proposition 4. Let G' = {V',E') be a plane subgraph of {V, E, F) that is de- 
composed into intrafacial cycles. Then the intrafacial cycles of G' can be signed 
such that, for any framework ip for {V, E, F) and for every f € F, all edges in 
(/?(/) n E' have the same sign. 
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Proof. All cycles referred to in this proof are those of the G' decomposition. 
Since G is bipartite, all cycles are signable. We will show that the following rule 
yields the desired signing: if a cycle lies in the interior of an even number of 
cycles (not counting itself), sign it clockwise; otherwise sign it counterclockwise. 

Consider any face f € F. Then / is interior or exterior to any cycle. Let 
n denote the total number of cycles that contain / and assume, without loss 
of generality, that ip{f) is clockwise oriented. If / is not frame adjacent to any 
cycle, the proposition holds vacuously for /. Next, we consider the cases when 
/ is interior or exterior to a frame adjacent cycle. 

Let c be a frame adjacent cycle that contains / in its interior. Then c must 
be the only such cycle and must also be the innermost cycle that contains / 
in its interior. Therefore, c must be interior to n — 1 cycles. For n odd (even), 
c is signed clockwise (counterclockwise). Since / is interior to c and <f{f) is 
clockwise oriented, its frame edges in c are drawn from a black to a white node 
in the clockwise direction along c. Hence, the frame edges in c are all assigned a 
+1 if n is odd and a -1 if n is even. 

Next, let C denote the set cycles exterior and frame adjacent to /. All cycles 
in C are signed with the same orientation since each must be interior to the 
same number of cycles as /. Since ip{f) is clockwise oriented, the frame edges 
that / shares with a cycle in C are drawn from a black node to white node in 
the counterclockwise direction along the cycle. Thus, the frame edges of / in the 
exterior cycles are assigned a +1 if n is odd and a -1 if n is even. 

In both cases, edges in ip{f) n E' are assigned a +I if n is odd and a -I if n 
is even. □ 

4 Formulations of the Maximum Resonant Set Problem 

The following integer program (IPl) is a formulation of the maximum resonant 
set problem that was proposed by Hansen and Zheng [7] to compute the Clar 
number of a benzenoid system. 

max : Kx + Ry =1, x > 0, y > 0, x € y € 



where 

K is the V X E node-edge incidence matrix of G = (V, E), and 
R is the V X E node-face incidence matrix of {V, E, F). 

To show that the linear programming relaxation of IPl yields an integral 
polytope, it is convenient to consider a reformulation of this integer program. 
Given a framework ip for (V, E, F), we define its edge-frame incidence matrix U 
to be an if X F matrix such that, for each face f G F, its corresponding column 
of U is the edge incidence vector of the frame <p{f). The incidence matrix of 
a framework yields the following useful factorization of the matrix R defined 
above. 
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Lemma 5. Let U be the incidence matrix of a framework. Then, R = KU . 

Introducing the vector of variables z = x + Uy yields an alternative formu- 
lation of the maximum resonant set problem (IP2). 

max {l^y : Kz = 1, a; -b Uy — z = 0, x > 0,y > 0, z > 0, y G , x,z G 2Z^}. 

All feasible solutions to the above integer program are necessarily binary 
vectors. Hence, constraints Kz = 1 express that z is the incidence vector of a 
perfect matching for G. Constraints x + Uy — z = Q partition the edges in the 
perfect matching represented by z between those in faces considered resonant 
(represented by Uy) and those identified by x. 

Note that this alternative formulation of the maximum resonant set problem 
is valid for the incidence matrix U of any framework of (V, E, F). To facilitate the 
subsequent analysis of this integer program, it is advantageous to assume that 
U is the incidence matrix of a simple framework. Recall this occurs whenever 
the framework is oriented. 



5 Unimodularity of the Constraint Matrices 

In this section we prove that the constraint matrices of both formulations of 
the maximum resonant set problem are unimodular. Therefore, their linear re- 
laxation polytopes are integral and the optimization problems can be solved 
directly as linear programs. 

First, we prove that the constraint matrix of the integer program IP 2 is 
unimodular when U is the incidence matrix of a simple framework. The general 
form of this matrix is 



0 0 K 
lU -I 



The incidence matrix of a simple framework has the following useful property. 

Lemma 6. Let U be the incidence matrix of a simple framework. Then, each 
row of U has at most one entry of 1. 

We are now ready for the main result of this section. 



Theorem 7. Let U be the incidence matrix of a simple framework (p. Then, the 
constraint matrix of the integer program LP2 is unimodular. 



Proof. Since unimodularity is preserved when rows and columns are multiplied 
by —1, it is equivalent to show that the following matrix is unimodular. 



A = 



0 0 K 
LU L 
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To show that matrix A satisfies the first condition of Theorem 2, it suffices 
to choose as basis B a column submatrix 



B = 



0 Kt 

1 Ix 



where Kt and It are the column submatrices of K and I, respectively, corre- 
sponding to the edges of a spanning tree of G. Since G is bipartite, K is totally 
unimodular. This implies that B is totally unimodular and, in particular, B is 
unimodular. It can easily be shown that if A is factored as A = BG, then all 
entries of G are 0, 1 or -1. 

Next, we show that the second condition of Theorem 2 is satisfied. Let A' be 
a column eulerian column submatrix of A. Then 



0 0 Ke' 

Id> Uf' Ie' 



where D' G E, E' G E and E' C F are edge and face subsets indexing the 
columns of A! . Since A is column eulerian and each row of U has at most one 
entry of 1, it follows that the nonzero rows of [Id' Up' Ie'] have exactly two 
entries of 1. To prove that A is linearly dependent, we must show there are 
vectors u € IR'° , v € , and w € M'® , not all zero, such that 



o 

o 




u 


\_Id' Uf' Ie’ \ 




V 






w 



= 0 



( 1 ) 



If E' is empty, we can set all entries of u equal to 1 and all entries of v equal 
to -1. Otherwise, let G' denote the plane subgraph of G = (V, E) induced by E' . 
Since Ke> is column eulerian, G' is a plane graph with all even degree nodes. By 
Proposition 3, we can assume that G' is decomposed into intrafacial cycles. Since 
these cycles are even and edge-disjoint, each cycle can be signed independently 
of the others. Note that in any signing, if we set We. equal to the value given to 
edge e in the cycle signing, for each e € E' , then Ke'W = 0. In particular, we 
choose to sign the cycles so that, for each face f G F, We = We' for all pairs 
of edges e and e' in (p{f) H E' . We showed in Proposition 4 that such a signing 
exists. 

To assign values to the entries of u and v we proceed as follows. For each 
/ G F', if ip{f) n if' yf 0, we let Vf = —We for e G (p{f) n E'. Otherwise, 
if ip{f) n if' = 0, Vf can be chosen arbitrarily, say Vf = 1. Finally for each 
e G D' , either e G if' or e G ipif) for some / G F'. In the first case, we assign 
Ue = —We and, in the second case, we let Ue = —Vf. It can now be easily seen 
that the u, v, w vectors so defined are a solution to the homogeneous system (1), 
establishing that the columns of A are linearly dependent. □ 
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For any given framework incidence matrix U, let P 2 (b') denote the polyhedron 
defined by the system of equations with constraint matrix of problem IP2 and 
integer right hand side vector together with nonnegativity constraints on all 
variables. Similarly, let Pi{b) denote the polyhedron defined by the system of 
equations with constraint matrix of problem IPl with integer right hand side 
vector b, and nonnegativity constraints on all variables. Proving the following 
lemma is straightforward. 

Lemma 8. Let b be an integer vector such that Pi{b) is nonempty. Then, there 
exists an integer vector b' such that (x,y) is an extreme point of Pi (b) if and 
only if {x,y,x + Uy) is an extreme point of P 2 {b'). 

Note Lemma 8 holds for the incidence matrix U of any framework. In par- 
ticular, if we choose the framework to be simple, then combining Theorem 7, 
Lemma 8, and Theorem 1 establishes the unimodularity of the constraint matrix 
of problem IPl. 

Corollary 9. The constraint matrix of problem IPl is unimodular. 

In conclusion, we have established that the constraint matrices of both for- 
mulations of the maximum resonant set problem are unimodular. In particular, 
for IP2 we showed here that the result holds when the framework is simple. Al- 
though it is not necessary for our development, it can be shown with additional 
effort that Theorem 7 is also valid for an arbitrary framework. 

6 Minimum Weight Cut Cover Problem 

Hansen and Zheng [6] proposed a class of cuts for benzenoids systems and defined 
an optimal covering problem based on these cuts. In this section we apply their 
definitions to a larger class of plane graphs. The proofs of results established 
by Hansen and Zheng [6] extend directly to the more general version considered 
here, so we will omit them. As before, {V, E, F) continues to denote the plane 
map of a bipartite and 2-connected planar graph G = (V,E), where the node 
set is bipartitioned as P = Pi U P 2 • We also assume in this section that G has a 
perfect matching. 

Definition 10. A cut of {V,E,F) is a simple (possibly closed) plane curve c 
such that 

(1) c intersects G only at some edge subset Ec of E, (i.e., c does not go through 
any nodes); 

(2) the subgraph Gc = (V,E\ Ec) is disconnected such that the nodes of each 
edge in Ec belong to different components of Get and 

(3) all nodes of edges in Ec in any component of Gc belong to either Pi or P 2 . 

Definition 11. A cut cover of (V,E,F) is a set of cuts C such that each face 
in F is intersected by at least one cut in G. 
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Definition 12. Let M C E be a perfect matching for (V, E, F). The weight me, 
with respect to M , of a cut c is me = \Ec H M\ and the weight m{C) of a cut 
cover C is m{C) = 

Hansen and Zheng [6] proved the following two theorems. 

Theorem 13 ([6]). The weight of a cut is the same for all perfect matchings. 

Thus, for any cut cover C of {V,E,F), its weight m(C') is also independent 
of the perfect matching. Let cov(V, E, F) denote the value of a minimum weight 
cut cover for {V,E,F). Then 

Theorem 14 ([6]). For all {V,E,F), res{V, E, F) < cov{V, E, F). 

Hansen and Zheng [6] conjectured that the above inequality is satisfied as 
equation by all benzenoid systems. In what follows, we sketch our proof that this 
conjecture holds true for the class of plane maps we consider in this paper. To 
accomplish this, we will define a minimum cost network flow problem associated 
with the minimum weight cut cover problem. The directed graph of this flow 
problem is based on the geometric dual graph of the plane map (V, E, F) . 

Let G* = {FU {t}, E*) denote the dual graph of (V, E, F), where t denotes 
the external face of G. We now transform G* into a directed graph. Let (p denote 
the clockwise oriented framework of (V,E,F) and let {/, z} be an edge in E* . 
Without loss of generality we can assume / G F. If the edge that separates faces / 
and i in (V, E, F) belongs to p{f), then {/, i} becomes arc (/, i). Otherwise, {/, i} 
becomes the arc (i,f). Next, for each node / G F, we perform a node splitting 
transformation to obtain nodes /i G Fi and /2 G F2 connected by the arc 
(/i) 72) G Ap, where Fi and F2 are two copies of F and is the set of resulting 
arcs connecting nodes in these two sets. Outgoing (incoming) arcs of node / 
become outgoing (incoming) arcs of node /2 (/i). Let D = (FiUF2U{t}, ApLlAp) 
denote the resulting directed graph, where is the set of arcs connecting nodes 
in different faces. Observe that each directed cycle in F = (FiUF2U{t}, ApLlAp) 
corresponds to a cut of (V, E, F). 

We now define a minimum cost circulation problem on the directed graph 
D. A lower bound of 1 is imposed on the flow value of each arc in Ap (this 
enforces that all faces in F are covered by cuts). Flow on all remaining arcs is 
required to be nonnegative. The cost assigned to each arc in Ap is zero. Finally, 
let p G { 0 , 1 }® be the incidence vector of a perfect matching for G = (V,E). 
The cost of arcs in Ap is given by the corresponding entry of the vector p. Let 7 
denote the optimal value of this network flow problem. Since the right hand side 
data for this network flow problem is integer, there exists an optimal solution 
to the problem that is integer. As integer solutions can be interpreted as cut 
covers, it follows that cov{V, F, F) < 7. We next prove that 7 < res{V, E, F). 

Let U be the incidence matrix of the counterclockwise oriented framework of 
{V, E, F) and let a G { — 1 , 0 , 1 }® be defined as a = {U — U)l. We note that the 
nonzero entries of a correspond to the edges in the perimeter of (V, E, F) and 
they alternate in sign along the perimeter. 
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The dual of the network flow problem can be written as follows: 
minimize y 



subject to Uu— Uv — aw + x = p 


(2) 


y — u + V = 0 


(3) 


y, u,v,w,x > 0 


(4) 



where y,u,v € IR^, w G IR, and x G IR'®. 

The proof is completed by showing that the x, y components of any feasible 
solution to the above dual problem are also feasible for the linear relaxation of 
problem IPl. The key observation here is that premultiplying constraints (2) by 
K (the node-edge incidence matrix of G) and rearranging terms gives 

Kx + R{u — v) = Kp = 1 

Using equations (3), we substitute y for (u — v) and obtain Kx + Ry = 1, the 
equation constraints of problem IPl. Thus, the strong duality result follows. 

Theorem 15. For all (V,E,F), res{V, E, F) = cov{V, E, F). 

The arguments in the proof of Theorem 15 also show that res{V, U, F) and 
cov(V, E, F) can be computed in strongly polynomial time using a network flow 
algorithm. 
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Abstract. In this paper we consider a variant of the betweenness prob- 
lem occurring in computational biology. We present a new polyhedral 
approach which incorporates the solution of consecutive ones problems 
and show that it supersedes an earlier one. A particular feature of this 
new branch-and-cut algorithm is that it is not based on an explicit integer 
programming formulation of the problem and makes use of automatically 
generated facet-dehning inequalities. 



1 Introduction 

The general Betweenness Problem is the following combinatorial optimization 
problem. We are given a set of n objects 1,2, ...,n, a set B of betweenness 
conditions, and a set B of non-betweenness conditions. Every element of B (of 
B) is a triple {i,j, k) (a triple (i,j, k)) requesting that object j should be placed 
(should not be placed) between objects i and k. The task is to find a linear order 
of all objects such that as few betweenness and non-betweenness conditions as 
possible are violated, resp. to characterize all orders that achieve this minimum. 
If violations are penalized by weights, we call the problem of finding a linear order 
minimizing the sum of weights of violations Weighted Betweenness Problem. This 
problem is A/T’-hard in general. 

In this paper we consider a special variant of this problem occurring in com- 
putational molecular biology, namely in the physical mapping problem with end 
probes. For the purpose of this paper we do not elaborate on the biological back- 
ground, but refer to [4]. We define the problem simply as follows. We are given 
a set of m so-called clones i G {1,2,..., m} to each of which two so-called end 
probes i* and are associated. These n = 2m probes are numbered such that 
= 2z — 1 and = 2i. Depending on the data, we have for every pair of a clone 
i and a probe j G {1, 2, . . . , n} \ {i*, either a betweenness condition i^) 
or a non-betweenness condition i^). Violation of a betweenness condition is 
penalized with cost Cp, and violation of a non-betweenness constraint receives a 

R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 213-228, 1998. © Springer-Verlag Berlin Heidelberg 1998 
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penalty of . The problem is then to find a linear order of the probes minimizing 
the sum of penalties for violated constraints. 

This Weighted Betweenness Problem can also be stated in a different version: 
A 0/1 matrix A € {0, 1}™^” has the consecutive ones property (for rows) if the 
columns of A can be permuted so that the I’s in each row appear consecutively. 
For a 0/1 matrix B G {O,!}™^” having the consecutive ones property let n® 
(n^) denote the number of I’s (of O’s) that have to be switched to transform 
A into B. For given nonnegative numbers Cp and Cp, we define the Weighted 
Consecutive Ones Problem as the task to find a matrix B with the consecutive 
ones property minimizing CpU® + c^n^. This problem is known to be A^-hard 
[2]. All column permutations tt of a feasible matrix B so that the I’s in each row 
of B^ appear consecutively can be found in time linear in the number of I’s in 
B by a so called PQ-tree algorithm [1]. 

For our discussion, we assume that the data is given as clone x probe 0/1- 
matrix A where a^jt and are fixed to 1. The other entries are obtained from 
some experiment where an entry = 1 gives rise to a betweenness constraint 
(i^,j,i^) and an entry = 0 corresponds to a non-betweenness constraint 
(P,j,i^)- A solution of the Weighted Betweenness Problem corresponds then 
to a solution of the Weighted Consecutive Ones Problem with the additional 
constraint that in some column permuted matrix B~^ (in which the I’s in each 
row appear consecutively) the first and the last 1 in each row i correspond to 
the end probes of clone i. 

The Weighted Consecutive Ones Problem models the biological situation 
when the information that the probes are the ends of clones is missing [8]. Note 
that by introducing artificial variables the Weighted Consecutive Ones Problem 
can easily be transformed to a Weighted Betweenness Problem. 

The paper is organized as follows. Section 2 discusses our previous approach 
which could already be applied successfully. An improved approach is presented 
in section 3 leading to the definition of the betweenness polytope. This polytope 
is then studied in the following section. Separation in the context of a branch- 
and-cut algorithm to solve the betweenness problem to optimality is the topic 
of section 5. Computational results conclude this paper. 

2 A First IP Model 

Our previous computational approach to the Weighted Betweenness Problem was 
based on a different model. We state it here mainly for a comparison and for 
introducing some notations. In the following we will introduce 0/1 variables in- 
dicating whether a betweenness or non-betweenness constraint is violated. Since 
a betweenness condition (i,j,k) is violated if and only if the non-betweenness 
condition (i,j,k) is satisfied, we can just complement variables and will speak 
only about betweenness conditions in the following. The objective function co- 
efficients of the variables will then give the preference if the condition should be 
satisfied or violated. 
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Let m be the number of clones and n be the number of probes. In our special 
setup we have n = 2m and we have n — 2 betweenness constraints for 

every clone i. We write, for short, {i,j) for the betweenness constraint (z*,j, z^), 
and call (i,j) a clone-probe pair. The set Bm of all possible clone-probe pairs is 

Bm ■■= \ ^ < i < m, \ < j < n, j ^ i\ j ^ z'*}. 

Obviously, \Bm\ = 2m{m — 1). 

We will develop an integer programming formulation with two types of vari- 
ables. For every ordered pair (z, j) of probes, we introduce a 0/1 variable z/y 
which has value 1 if and only if z precedes j in the order tt. With every clone- 
probe pair (z, j) G Bm, we associate the 0/1 variable Xij which has value 1 if and 
only if constraint (z*,j, z^) is met in the order tt. Equivalently, Xij = 0 if and 
only if the non-betweenness constraint (z*,j, z^) is met. 

To ensure that the variables pij encode a linear order tt of the probes, the 
constraints of the IP formulation of the linear ordering problem have to be met, 
i.e., they have to satisfy 

Hij Uji = 1, for all 1 < z < J < n, 
iJij + Vjk + Vki < 2, for all triples 1 < z, j, fc < n. 

To ensure that the xij count violations of the betweenness and nonbetweenness 
constraints, we have to add further inequalities. To force a Xij, (i,j) G Bm to 
be 0 if and only if (z*,j, z^) is violated (or (z‘,j, z^) is satisfied), we add 

^ij — Vji^ T Ujp J 
— Vi^j T ypj J 
Xij ^ yi^j yji^ T 1, 

^ij ^ yi^j T yji^ 1- 

Thus, Xij is 1 if and only if z/jtj = yjih . 

We do not discuss the objective function here. Due to the positive param- 
eters Cp and Cp we can omit some of these inequalities depending on whether 
Xij corresponds to an original betweenness or non-betweenness condition, and 
moreover, we only have to require 



0 < x,j < 1 . 

The objective function will force these variables to have integer values if the 
linear ordering variables are integer. Note that the objective function is zero on 
the linear ordering variables. 

With every feasible solution of the problem, corresponding to a permutation 
TT of the probes, we associate 0/1-vectors G {0, l}"("~i) and x'^ G {0, 1}I^’"I 
with 

, ^ _ J 1 if z precedes j in the order tt, 

0 otherwise. 
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and 

^ f 1 if constraint (t*. j.i^) is met in the order tt, 

= otherwise. 

The polytope Pj^bw associated with the instance of the Weighted Betweenness 
Problem is the convex hull of all possible vectors ) , 



PjMBW — conv 




7 T is a permutation of the probes 



})■ 



Previous computations were based on partial descriptions of this polytope (see 

[4]). 



3 Modelling Without Linear Ordering Variables 



The formulation of the preceding section is somewhat redundant. Namely, let 
(^) G Plobw 1 } 6 ™ - 4 m incidence vector of the Weighted Between- 

ness Problem with n = 2m probes and the set Bm of betweenness constraints. 
Obviously x can be retrieved from ip, because 

Xij = l^-tpitj = Ipjih. 

Conversely, for a given x there exist one or more feasible settings of ip. These 
settings cannot be obtained directly but in linear time by application of the 
PQ-tree algorithm [1] . For a given x we define for every clone i three sets in the 
following way. 

Sii = {j I Xij = 1} u {z*}, 

Sn = {j I Xij = l}U{z'‘}, 

Sis = {j I Xij = 1} U {z‘} U {z^}. 

The feasible settings of the linear ordering variables ip correspond to all 
permutations of {!,..., zz} where the elements of every set introduced above 
occur in consecutive order. 

We now define the projection of P^bw X variables 

|x I there exists ip such that G Plobw 

It can easily be tested if a 0/1- vector x is contained in P^. Namely if not, 
i.e. if X G {0, X ^ Pbw^ then with the sets defined above for the 

clones, the application of the PQ-ixee algorithm would yield the result that 
no permutation exists in which the elements of the sets appear as consecutive 
subsequences. Otherwise, if x G i'^^en the PQ-tree provides all consistent 

permutations. 

Because a feasible linear ordering can be derived from a x G Pbw ^'^d the 
objective function is zero for the linear ordering variables, they can be omitted. A 




isvv = conv 
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solution of the Weighted Betweenness Problem for physical mapping is obtained 
by solving 



max (F X 
^ 

X £ 

Now we want to derive an integer programming formulation of this problem 
from an integer programming formulation of the Weighted Consecutive Ones 
Problem. 

For each y the sets S'ij, can be written as rows of a matrix G {0, i}3mx2m^ 
For 1 < z < TO and 1 < j < n and 1 < fc < 3 an entry j of has the 

value 1 if and only if j 

Proposition 1. y G Pbw */ if has the consecutive ones property 

for rows. 

Proof. Clear because of the construction of the sets Si^ . 

Corollary 2. Each integer programming formulation of the Weighted Consecu- 
tive Ones Problem leads to an integer programming formulation of the Weighted 
Betweenness Problem for physical mapping. 

Proof. Observe that the entries of are constant or equal to some correspond- 
ing entries of y. Hence, it exists a (simple) linear transformation from y to 
which can be used to substitute in an IP formulation of the Weighted Con- 
secutive Ones Problem, yielding an IP formulation of the Weighted Betweenness 
Problem. 

An IP formulation of the Weighted Consecutive Ones Problem can be derived 
from a theorem of Tucker [15]. In this paper it is shown that the 0/1 matrix M 
has the consecutive ones property for rows if and only if no submatrix of M is 
a member of {Mi}, where {Mf\ is a given set of 0/1 matrices. Now for each 
submatrix of M and each matrix Mi with the same size it is easy to derive an 
inequality, which is violated if and only if the submatrix is equal to Mi. The 
set of all these inequalities gives an IP formulation of the Weighted Consecutive 
Ones Problem. 

Using the linear transformation from y to M^ we obtain an IP formulation 
of the Weighted Betweenness Problem for physical mapping. For to = 2 the 
nontrivial inequalities of the formulation are 



2x12* — 2 Xi2*i — 3^21* — 3^21** — ^5 

and further inequalities which can be obtained by the symmetry operations 
described in section 4.2. Note that these two classes of inequalities do not define 
facets of PgiY- 
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4 The Polytope 

In this section we will investigate some properties of the polytope (see also 
[13] for a more detailed discussion). In particular we are interested in exhibiting 
classes of facet-defining inequalities. 

4.1 Dimension and Lifting 

We first determine the dimension of address the question of trivial 

(node) lifting of facets. 

Proposition 3. Let m > rn >2. An inequality < go for Pjf^, obtained by 
trivial lifting from an inequality /^a; < /o which is valid for is valid for 

TDm 

^BW- 

Proof. Let a binary vector y € Pj^ be given. Since by removing the components 
Xij with (i,j) G Bm \ Bjj^ from y one obtains a vector y which is a feasible 
incidence vector of proposition follows. 

We can also compute the dimension of a face of Pff^ which is induced by an 
inequality resulting from trivial lifting of an inequality of Pff^. 

Theorem 4. Let f'^x < fo be valid for P]!f^, and let F = {x G Pj!ft^ \ f^x = fo} 
be the induced face with dimF > 0. Let m > m and let g^a; < go be obtained 
from f'^x < fo by trivial lifting. Let G = {x G Pjf^ \ g^x = go}. Then 

dimG — dimF = \Bm \ — \Bm\ = 2m{m — 1) — 2rn{fh — 1). 

Proof. Obviously, it is sufficient to show the theorem for m = to -I- 1. Let 
TO = TO -|- 1 and k := dimi^ > 0. Then there exist fc -I- 1 permutations ttq, . . . , 
and corresponding incidence vectors with y’^* G F for i = 0, ... ,k and 
arankjy’^®, . . . ,y’^*'} = A: -I- 1. W.l.o.g. by symmetry considerations we may as- 
sume that 

7r(C^(i‘) < 7r(C^(t^) for 1 < z < to, 

for I <i < j <m. 

We now construct /c-|-l-|-|,8m| — |'8m| = A:-|-1-|-4to permutations po, . . . , Pk+Am 
with y'’* G G for z = 0, . . . , fc -I- 4to and arank{y'’“, . . . , yCfc+4m} = k + 4to -I- 1. 
For 0 < I < k set 

Pi = (7ri(l),...,7ri(2TO),TO*,TO'*) 

The remaining permutations Pk+u ■ ■ ■ , Pk+Am are obtained from tto by insert- 
ing TO* and TO** at different positions. For 1 < ? < to we define the permutations 
as 

Pk-2+3i = (7To(l), • . .7ro(7T(7^(/‘) - l),TO*,;*,TO**,7ro(7r(C*(Z‘) -h 1), . . .7 To(2to)), 

Pk-l+3l = (7To(l), • . .7ro(7T(7^(/‘) - l),TO'*,;‘,TO*,7To(7r(C*(?‘) -h 1), . . .7 To(2to)), 

Pk+3l = (7To(l), • . .7ro(7T(7^(/‘) - l),Z*,TO*,TO**,7To(7r(C*(?‘) -h 1), . . .7 To(2to)), 

Pk+3m+l = (7To(l),.. .7ro(7r(C^(l'*) - l),TO*,;**,TO**,7ro(7r(C^(l'*) -k 1),.. .7 To(2to)). 
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The following properties of xfj) ^ + 1 < I < k + 4m, (i,j) € Bm \ B^ are 
needed to show that the vectors are affinely independent. They result in a 
straightforward way from the definition ofp/,fc+l<^<A: + 4m. 

For I < r < fn it holds that ~ since 7T(7^(m*) < < 

TTo^ir*) < TTo^ir'^). _ 

By similar arguments, it holds for I < r <m that 

Pk-2 + 31 _ Pk-l + 3l _ Pk + 31 _ Q 

X-ij Xij Xij 

for 

Moreover, for 1 < ? < m, we have 

Pk-2 + 3l _ Pk-l + 3l _ Pk + 31 _ Q 

Xlm^ 



and 



Pfc-2+3! _ Pk-2+31 _ 1 

Pfc-l + 3i _ Pfe-1+3Z _ 1 



Pk+31 _ Pk + 31 _ 1 

Xlm^ Xlm^ 



In addition, for 1 < r,l < m 



Pk + 3m + l 

Ayyirk 




for I = r, 
otherwise, 



and, for 1 < r, I < m, 



Pk-2 + 3l _ Pk-l + 3l _ Pk + 31 _ Q 

A^nr^ '^mr^ -^mr^ 



In the following table, the rows are the incidence vectors ^ 

i < 4m, restricted to the variables in Bm \ Bm- The vectors are partitioned 



into m blocks of coefficients xl 

■^ml^ 5 • • • 1 ' 



Pk+i 



X/ 



Pk + i 



Xm/^% and the remaining coefficients 





Pk+i Pk+i Pk+i 




Pk + i yPk + i Pk + i 


x^'‘+: ■■■ x^^’+i 

A^in 


i = 1 
i = 2 
i = 3 


i-i 


0 


0 


0 








0 


0 


i = 3m — 2 
i = 3m — 1 
i = 3m 






1-7 


0 


i = 3m + 1 
i = 4m 




% 




I 
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The system has full rank and since it holds that, for 0 < I < k, 

\ 0 for {l,j)eBm\Brn, 

the vectors t • • • > are affinely independent. 

It is easy to see that G G for z = 0, . . . , A: + 4m. Hence, dim G > k + 4m. 
On the other hand, dim G < k + 4m, because by the lifting operation only 4m 
variables are added (for m = to + 1). Therefore, dimG = dimT’ + 4 to. 



From these results we obtain that trivial lifting preserves the facet-defining 
property and that Pj^ is full-dimensional. 

Corollary 5. Let to > to > 2. An inequality < go for Pjf^, obtained by 
trivial lifting from a facet- defining inequality /^a; < /o of is facet- defining. 



Corollary 6. For to > 2 



dimig^ = 2 to(to — 1), 
i.e., Pj^ is full-dimensional. 

Proof. P^w full-dimensional. By application of Theorem 4 with F = P^^y^/ and 
G = Pjff^ the corollary follows. 

4.2 Small Instance Relaxations 

Because facet-defining inequalities of are trivially liftable, we can obtain 
from linear descriptions of polytopes associated with small instances of the prob- 
lem, say for 2 < TO < 4, relaxations of such a relaxation 

small instance relaxation [3,6]. 

In order to characterize the symmetry properties of Pj!fl}y we use the following 
notation. Let v G IR^®’"^ The entries of v can be placed in an to by to clone- 
clone matrix v of 2-dimensional vectors. The rows and columns of the matrix 
correspond to the clones, and an entry Vij G IR^ of v is defined for z yf j as 
Vij = i.e., an entry ij refers to the relations of the end probes obtained 

from clone j with clone z. 

Now, the set of incidence vectors in Pjf^ can be partitioned into equivalence 
classes with respect to the following two operations. First, for y G Pjf^ an arbi- 
trary permutation of the clones is allowed. This corresponds to a simultaneous 
permutation of the rows and columns of the associated clone-clone matrix y. Sec- 
ond, for an arbitrary clone j its end probes can be reversed. This corresponds 
to reversing in column j of the clone-clone matrix the entries of (^o‘ ) to 

^ ' 'X.ijt ' 

for all 1 < z < TO, z yf J. 
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We used the algorithms for facet enumeration discussed in [5] to get more 
insight into the facet structure of for to < 4. It is clear that also the 
facet-defining inequalities of can be assigned to equivalence classes. 

has 7 vertices and is completely described by 3 equivalence classes of 
facets. Pbw vertices and 241 facets in total which can be partitioned 

into 16 equivalence classes of facets. They are displayed in Table 1 (/* denotes 
the clone-clone matrix representation of /*). Observe that the (lifted) facets of 
Pbw are among the facets of P^w < /q to < /§). 

The computation of the complete linear description of P^w which has 9,197 
vertices was not possible in reasonable time. However, by an algorithm for paral- 
lel facet enumeration (see [5]) we found more than 1, 16 • 10^ (!) different equiva- 
lence classes of facet-defining inequalities, yielding a lower bound of 4.4 • 10® for 
the number of facets of P^w 



4.3 Cycle Inequalities 

We use the following notation. For j G {1, . . . ,n}, c(j) denotes the clone from 
which the probe j is extracted, i.e., j = c{j)* or j = c{j)^, and by q{j) we 
denote the second probe q{j) which the clone c(j) defines. Of course, we have 

= j- 

A class of valid inequalities can be motivated by the following observation. 
Suppose we have n = 2m betweenness constraints (A*, 1, A^), . . . , (i„*, n, t„^), 
such that every probe j, j G {1, ... ,n}, is exactly once the probe between the 
ends of a clone ij yf c(j). Then for any permutation tt of the probes at least two 
betweenness constraints are violated, because the first probe 7 t( 1) and the last 
probe 7 t(2to) are not between other probes. Hence, the inequality 

n 

^ ' ^ijj ^ ^ ~ 2, * j yf c(j) (1) 

i=i 

is valid for The following cycle inequalities are a special case of these 

inequalities. In particular, they are facet-defining and they generalize the facet 
X < /g of Table 1. 

Theorem 7. Let Dn be the eomplete directed graph on n nodes corresponding 
to all probes, and let R be a cycle in Dn with the property that if k in V{R), 
then q{k) ^ V{R). Then 



^ ^ ^c(k)l T ^c(l)q{k) ^ 2|77| 2 (2) 

(k,l)eR 



defines a facet of Pffw- 
Proof. Let us assume that 



T 

a X = 



^ ^ aij Xij ^ b 



( 3 ) 
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Table 1. Facet-defining inequalities /* a; < /g of P^w 



II 


/ * (1) (0)\ 
VI/ VO/ 

(?) * (?) 


, /o = 2 


II 

(N 


f * n 

VO/ VO/ 

(-G) * (?) 


o 

II 


1 


U?) (?) * ) 




1 


o o 
* 


1 



/" = 



( * (-\)Q\ 
(:1) * Q 
Q *; 



/o =0 



/" = 



I * Q(:l)\ 
(^) * (^) 
V(:l)(o) * ) 



/o =2 





( * (?) (:?)^ 




( * (?) (:?)' 


\ 


/" = 


(?) * (?) 


, /o = 2 /® = 


(:?) * (?) 


, /o = 2 




U?)(^o^) * ) 




1 (?) (:?) * , 


/ 




( * (?) C?)^ 


1 1 


( * (?) (-V)\ 




r = 


(?) * (:?) 


II 

00 

(N 

II 


(?) * (-/) 


, /I = 4 




U?)(=?) * ) 


1 1 


U?)(") * / 





f = 



n\ (i\ \ 
\o) \o) \ 






(?) * (?) 


,/o 


U?) (?) * ) 




( • ffl (‘A 


(-G) * ( 


-/) 



fio ^ 



( * (0 (:1)\ 
(A) * (^) 

V a) (:^) * ; 



/o° = 2 



/o“ =2 = 









V(-o^)(-\) * J 

( * U)U)\ 

( 4 ) * (-^a) 

V (-a) (-a) * 

( * (-^)(-\)\ 
(A) * (f) 
V(-\)(f) * ; 



= 2 f^ = 



/o® = 6 



f'W ^ 



( * (-\)(-\)\ 
(-\) * (.y 
V(ioa) * 

( * (-\)(-\)\ 
(-\) * (- 0 ^) 
V(-\)(-o^) * 

( * G) (-G)\ 

(-°0 * (?) 

V (?) (-0^) * ; 



/o" = 2 



/o" = 2 



/o® = 2 
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is facet-defining for such that the face induced by (2) is contained in the 

facet induced by (3). We show that (3) is a positive multiple of (2). 

W.l.o.g. we set R = {(!*, 2‘), (2*, 3‘), . . . , (m‘, 1*)}. Then (2) becomes 

m— 1 

^ ^ “ 1 “ ^ ( 4 ) 

i=l 

Figure 1 displays the graph belonging to (4) for m = 4. 




Fig. 1. Cycle inequality with 4 clones 



First we show that all coefficients are constant for the clone-probe pairs 
(i,j) corresponding to the nonzero coefficients in (4). 

Let 



7Ti = (l'*,2‘,3‘,2^,...,i‘,(i- 

7T2 = (2‘, 3‘, 2^ . . . , (z - 1)\ . . . , m‘, (m - l)^ l‘). 

It is easy to see that and satisfy (4) as equation. Thus 
This yields ai 2 ‘ = Q 2 ih. 

Because R is invariant under the cyclic permutation 1— >2— >2— >1, 
the same operations applied to tti and 7T2 imply that 



023* — 032** J ■ ■ ■ ^ Oml* — Olm** ■ 



( 5 ) 
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Analogously, using the permutations 

7T4 = (2‘, 3‘, 2^ . . . , {i - l)^ . . . , m‘, (m - 1)\ l‘, l'*) 

we obtain 

Oi2t + 021'“ = ®23* + 032'* 5 • • ■ ) Om-lm* + “ ®ml* + Olm'* ■ (6) 

With (5) and (6) we have shown the first claim. 

Now we show that = 0, for all remaining coefficients. For this purpose we 
define the following m — 1 permutations, which only differ in the position of 1^. 

Pi = (l‘,2‘,3‘,...,(m- 1 )‘,to*,1'*,2'*,...(to- 

P 2 = (1*, 2*, 3‘, . . . , (m - 1)*, 1^, m\ 2^*, ... (to - 1)'*, to^), 

Pm-i = (1‘, 2‘, 3‘, . . . , (to - 1)*, to‘, 2^ ... (to - 1)", m^). 

Again, the associated incidence vectors satisfy (4) as equation. The equations 
g^T^Pi _ ^T^pi+i £qj. 1 < j < 774 _ 2, give 

= -a(^+i_qih, for 1 < z < TO - 2. (7) 

With the permutations (which only differ in the position of 1^) 



II 

0 ^ 


, (to — 1)*, TO*, 1*®, 


1*,2'®,. 


. . (to — 1)^, 


TO*®), 


II 

0 ^ 


, (to — 1)*, 1*®, TO*, 


1*,2*®,. 


. . (to — 1)^, 


TO*®), 


,_i = (2^1^3^ 


. . . , (to — 1)*, TO*, 


1*,2*®,. 


..(to - 1)'®, 


TO*®), 



we obtain analogously that 

= a(^+i_qih, for 1 < z < to - 2. (8) 

With (7) and (8) we have shown that 013* = a^ih = 074* = 041^ = . . . = 

Applying cyclic permutations to pi and rz, for 1 <z<to— 1, we finally 
obtain that all coefficients are zero for all zero-coefficients in (4). 

Hence, the left hand side of (3) is a multiple of the left hand side of (4). By 
the properties of (3) and (4), (4) induces that same facet as (3). 

5 Separation Procedures 



In our computations, we do not use the IP formulation presented above, because 
on the one hand it is fairly complicated and on the other hand the inequalities 
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do not define facets in general. Rather, we proceed as follows. As discussed 
before, we can check feasibility of an integer vector x* by making use of the 
PQ-tree algorithm. If x* is feasible, then the PQ-tree algorithm also generates 
all optimal solutions tt*. Otherwise, if x* is not feasible (but binary), then one 
can construct a cutting plane, which is satisfied by all 0/1-vectors different from 
X* . Let P = {i\ X* = 1} and Z = {i\ x* = 0}, then 

< |P| - 1 

ieP i&Z 

is a cutting plane with the desired properties. 



5.1 Separation of Cycle Inequalities 

The separation of the cycle inequalities can be done by a shortest path algorithm. 
Let X* be an LP solution and = {Vn, A„) be the complete directed graph on 
n nodes with edge weights 



Then it is easy to see by the following transformation that a cycle with weight 
less than 2 corresponds to a violated inequality. Let y* = 1 — x*. Then x* violates 
a cycle inequality (4) if and only if y* violates an inequality 

+ ^c{j)q{i) P 2 

{i,j)eR 



which is true if and only w{R) < 2 for the cycle R in = {Vn, A„). 



5.2 Separation of Small Instance Relaxations 



We also implemented separation heuristics for inequalities of small instance re- 
laxations. 

Let an LP solution x* G IRI®’”! and a facet-defining inequality < go be 
given. For a permutation ct of {!,..., m} we define 



C(a) = 



E E 

1=1 



f 9a{i)a{jy \ ^ 



It is clear that the set {a \ C{a) > go} gives all violated inequalities among 
all inequalities which are (by means of relabeling of the clones) equivalent to 
g"^x < go- Thus the separation problem for equivalent inequalities (relative to 
relabeling of the clones) is a quadratic assignment problem in which all entries 
are 2-dimensional vectors. 
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Because an inequality x < go of a small instance relaxation results from 
trivial node lifting of an inequality /^a; < /o which is facet-defining for 
TO < TO we have 

C{a) = f^ 

Since to is rather small (to < 4) the quadratic assignment problem is com- 
putationally tractable and can effectively be solved by heuristics; in our imple- 
mentation we use a Grasp procedure similar to [12]. 

In order to separate all inequalities of one equivalence class this separation 
procedure has to be executed for at most 2™ different inequalities ffx < /g 
which are equivalent relative to the end probe reversing symmetry. 

6 Computational Results 

In our branch-and-cut computations we used ABACUS [14,9,10] with LP solver 
CPLEX [7] and an implementation of the PQ-tree algorithm [11]. 

In [4] we used a set of 150 randomly generated problem instances with 42 to 70 
probes which resemble instances occurring in practical applications. To generate 
the data the clones were randomly distributed across the chromosome. We used 
a coverage (which gives the average number of how often a single point along the 
chromosome is covered by a clone) varying from 3 to 5, a false negative rate of 
10%, and a false positive rate varying from 0% to 5%. To create a false positive 
or false negative, a coin was flipped at each entry with the given probability. 
Across experiments with varying coverage, the clone length was held constant. 

Our computational experiments show that the new approach clearly super- 
sedes our previous approach. Table 2 compares our previous approach (using 
linear ordering variables) with the new model described here. The table displays 
the average CPU-times ttot (in hh:mm:ss, on a Sun Sparc IPX), the average 
number of nodes of the branch-and-cut tree the average number of cutting 
planes which are added to the linear program ricut and the average number of 
LP reoptimizations nip to solve the problem instances. 



Table 2. Average values for 150 problems. 



Model 


itot 


'^sub 


'^cut 


'^Ip 


Linear ordering based 


0:44:32 


3.2 


22197.8 


201.5 


PQ-tree based 


0:01:28 


1.1 


1022.0 


6.3 



In our new approach the following strategy for separating a fractional LP 
solution turned out to be favorable. If by exact enumeration of the nontrivial 
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facet classes of not enough cutting planes are found, we execute the heuris- 
tics of section 5.2 for a small subset (less than 15) of all equivalence classes of 
facet-defining inequalities of Pgy^. If this separation fails, then we separate the 
cycle inequalities. 

To improve the performance of our branch-and-cut algorithm we developed 
algorithms for executing the separation procedures for different classes of facets 
in parallel. In addition, we studied by computational experiments two important 
questions in detail: 

— Which and how many classes of facets should be used for separation? 

— Given a large set of cutting planes generated by the separation heuristics, 

which ones should be selected and should be added to the LP? 

Concerning the first problem, we observed that facet classes with a large 
number of roots (feasible incidence vectors contained in the facet) are the most 
important ones. The best solution to the second problem is to use cutting planes 
whose normal vectors have a small angle with the objective function gradient. 
Detailed computational experiments using small instance relaxations in parallel 
branch-and-cut algorithms are reported in [3,6]. 

For the Weighted Betweenness Problem we tested 25 different strategies on 
8 randomly generated instances with the following characteristics: 125 clones 
(=250 probes), a coverage of 4, a false positive rate of 5% and a false negative 
rate of 10%. With our best strategy we could solve the 8 instances on a parallel 
machine with 8 Motorola Power PC processors (7 of them reserved for executing 
the separation heuristics) in 2:01 :34h total time. 
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Abstract. We develop an algorithm for solving a linear diophantine 
equation with lower and upper bounds on the variables. The algorithm 
is based on lattice basis reduction, and first finds short vectors satisfying 
the diophantine equation. The next step is to branch on linear combi- 
nations of these vectors, which either yields a vector that satisfies the 
bound constraints or provides a proof that no such vector exists. The 
research was motivated by the need for solving constrained linear dio- 
phantine equations as subproblems when designing integrated circuits 
for video signal processing. Our algorithm is tested with good result on 
real-life data. 

Subject classification: Primary: 90C10. Secondary: 45F05, 11Y50. 



1 Introduction and Problem Description 

We develop an algorithm for solving the following integer feasibility problem: 

does there exist a vector x G such that ax = oq, 0 < x < u? (1) 

We assume that a is an n-dimensional row vector, u is an n-dimensional column 
vector, and that oq is an integer scalar. This is an NP-complete problem; in 
the absence of bound constraints, it can be solved in polynomial time. The 
research was motivated by a need for solving such problems when designing 
integrated circuits (ICs) for video signal processing, but several other problems 
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can be viewed as problem (1), or generalizations of (1). One such example is 
the Frobenius problem that was recently considered by Cornuejols, Urbaniak, 
Weismantel and Wolsey [4]. The instances related to video signal processing 
were difficult to tackle by linear programming (LP) based branch-and-bound 
due to the characteristics of the input. In order to explain the structure of these 
instances we briefly explain the origin of the problem below. We also tested our 
algorithm with good results on the Frobenius instances of Cornuejols et al. 

In one of the steps of the design of ICs for video signal processing one needs 
to assign so-called data streams to processors. A data stream is a repetitive set 
of arithmetic operations. The attributes of a data stream are the starting time 
of the first execution, the number of repetitions, and the period of repetition. 
One can view a data stream as a set of nested loops. The outer loop has an 
iterator zq : 0 < to < lo- The following loop has iterator zi : 0 < zi < Ii, and so 
forth. The periodicity corresponding to a loop is the time interval between two 
consecutive iterations. When constructing an assignment of the streams to the 
processors the following conflict detection problem occurs: check whether there is 
any point in time at which operations of two different streams are carried out. If 
such a point in time exists, then the streams should not be assigned to the same 
processor. Consider an arbitrary data stream /. Let i/ = (z/o, z/i, . . . , z/m)^ be 
the iterator vector of the stream. The iterator vector satisfies upper and lower 
bounds, 0 < i/ < I/. Let p/ denote the period vector and s/ the starting time 
of the stream. The point in time at which execution i/ of data stream / takes 
place is expressed ast(i/) = s/ + pji/. The conflict detection problem can be 
formulated mathematically as the following integer feasibility problem: given 
data streams / and g, do there exist iterator vectors i / and \g such that 

Sf + P/i/ = Sg + Pg igj and such that 0 < i/ < I/, 0 < ig < Ig? 

The Frobenius problem is defined as follows: given nonnegative integers 
(ai,...,a„) with gcd(ai, . . . , a„) = 1, And the largest integer uq that cannot 
be expressed as a nonnegative integer combination of oi, . . . , a„. The number oq 
is called the Frobenius number. The instances considered by Cornuejols et al. 
[4] were also hard to solve using LP-based branch-and-bound. They developed 
a test set approach that was successful on their instances. 

When solving a feasibility problem such as (1) by LP-based branch-and- 
bound, two difficulties may arise. First, the search tree may become large due 
to the magnitude of the upper bounds on the variables, and second, round-off 
errors may occur. The size of the branch-and-bound tree may also be sensitive 
to the objective function that is used. For our problem (1) an objective function 
does not have any meaning since it is a feasibility problem; as long as we either 
And a feasible vector, or are able to verify that no feasible vector exists, the 
objective function as such does not matter. The problem is that one objective 
function may give an answer faster than another, but which one is best is hard 
to predict. An objective function also introduces an aspect to the problem that 
is not natural. Round-off errors occur quite frequently for the instances related 
to the conflict detection problem, since the coefficients of some of the variables 
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are very large (« 10^). The special characteristics of these instances - some very 
large and some relatively small coefficients and a very large right-hand side value 
ag - are due to the difference in periodicity of the nested loops. This difference is 
explained by the composition of a television screen image. Such an image consist 
of 625 lines, and each line is composed of 720 pixels. Every second 25 pictures 
are shown on the screen, so the time between two pictures is 40 ms. The time 
between two lines and between two pixels are 64 /us and 74 ns respectively. Since 
the output rate of the signals has to be equal to the input rate, we get large 
differences in periodicity when the data stream corresponds to operations that 
have to be repeated for all screens, lines and pixels. Due to the large difference 
in the magnitude of the coefficients we often observe that the LP-based branch- 
and-bound algorithm terminates with a solution in which for instance variable 
Xj takes value 4.999999, simply because the hardware does not allow for greater 
precision. If one would round xj to xj = 5.0, then one would obtain a vector 
X such that ax ^ ag. It is obviously a serious drawback that the algorithm 
terminates with an infeasible solution. 

To overcome the mentioned deficiencies we have developed an algorithm 
based on the basis reduction algorithm as developed by Lenstra, Lenstra 
and Lovasz [10]. The motivation behind choosing basis reduction as a core of 
our algorithm is twofold. First, basis reduction allows us to work directly with in- 
tegers, which avoids the round-off problems. Second, basis reduction finds short, 
nearly orthogonal vectors belonging to the lattice described by the basis. Given 
the lower and upper bounds on the variables, we can interpret problem (1) as 
checking whether there exists a short vector satisfying a given linear diophantine 
equation. It is easy to find an initial basis that describes the lattice containing 
all vectors of interest to our problem. This initial basis is not “good” in the 
sense that it contains very long vectors, but it is useful as we can prove struc- 
tural properties of the reduced basis obtained by applying the algorithm to 
it. It is important to note that basis reduction does not change the lattice, it 
only derives an alternative way of spanning it. Furthermore, our algorithm is 
designed for feasibility problems. Once we have obtained the vectors given by 
the reduced basis, we use them as input to a heuristic that tries to find a feasible 
vector fast or, in case the heuristic fails, we call an algorithm that branches on 
linear combinations of vectors and yields either a vector satisfying the bound 
constraints, or a proof that no such vector exists. 

In Sect. 2 we give a short description of the basis reduction algorithm 
and a brief review of the use of basis reduction in integer programming. In Sect. 
3 we introduce a lattice that contains all interesting vectors for our problem 
(1), and provide an initial basis spanning that lattice. We also derive structural 
properties of the reduced basis. Our algorithm is outlined in Sect. 4, and we 
report on our computational experience in Sect. 5. 

We are indebted to Laurence Wolsey for stimulating discussions and for nu- 
merous suggestions on how to improve the exposition. 
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2 Basis Reduction and Its Use in Integer Programming 

We begin by giving the definition of a lattice and a reduced basis. 

Definition 1. A subset L C M” is called a lattice if there exists linearly inde- 
pendent vectors bi, b 2 , . . . , b; in K" such that 

L = {Ej=i Oijhj : aj e Z, 1 < j < 1}. (2) 

The set of vectors bi, b 2 , • . • , b/ is called a lattice basis. 

Gram-Schmidt orthogonalization is an algorithm for deriving orthogonal vectors 
bj) 1 < J < ^ from independent vectors b^, I < j < L The vectors b*, I < j < I 
and the real numbers fXjk, ^ k < j < I are defined inductively by: 

bj* = bj - J2i=\ h-jkK (3) 

f,,, = (b,rb*/(b*)^b* (4) 

Let II II denote the Euclidean length in M". Lenstra, Lenstra and Lovasz [10] 
used the following definition of a reduced basis: 

Definition 2. A basis bi,b 2 , . . . ,b; is reduced if 

\Tjk\ < ^ for l<k < j <l (5) 

and 

||b* + M,.,-ib*_i|P> |||b*_i|P/or Kj<L (6) 

The vector b* is the projection of bj on the orthogonal complement of X)i=i ®-bfc, 
and the vectors b* + and b*_^ are the projections of hj and bj_i on 

the orthogonal complement of X)fc=i ®,bfe. Inequality (5) states that the vectors 
\ < j <l are “reasonably” orthogonal. Inequality (6) can be interpreted as fol- 
lows. Suppose that vectors bj_i and hj are interchanged. Then the vectors b*_j^ 
and b* will change as well. More precisely, the new vector b*_j^ is the vector 
b* -I- Inequality (6) therefore says that the length of vector b*_;^ 

does not decrease too much if vectors vectors bj_i and hj are interchanged. The 
constant | in inequality (6) is arbitrarily chosen and can be replaced by any 
fixed real number j < y < 1. Lenstra et al. [10] developed a polynomial time 
algorithm for obtaining a reduced basis for a lattice given an initial basis. The 
algorithm consists of a sequence of size reductions and interchanges as described 
below. For the precise algorithm we refer to [10]. 

Size reduction: If for any pair of indices j,k : 1 < k < j < I condition (5) is 
violated, then replace hj by b^ — \fijk\hk, where [/ijfcj = \p.jk — 

Interchange: If condition (6) is violated for an index j, I < j < I, then inter- 
change vectors bj_i and hj. 
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Basis reduction was introduced in integer programming by H.W. Lenstra, 
Jr. [11], who showed that the problem of determining if there exists a vector 
X G 2Z^ such that Ax < d can be solved in polynomial time when n is fixed. 
Before this result was published, only the cases n = 1,2 were known to be 
polynomially solvable. The idea behind Lenstra’s algorithm can be explained 
considering a two-dimensional convex body. Suppose that this body is “thin” as 
illustrated in Fig. 1. If it extends arbitrarily far in both directions, as indicated 
in the figure, then an LP-based branch-and-bound tree will become arbitrarily 
deep before concluding that no feasible solution exists. It is easy to construct a 
similar example in which a feasible vector does exist. So, even if n = 2, an LP- 
based branch-and-bound algorithm may require exponentially many iterations 
in terms of the dimension. What Lenstra observed was the following. Assume 







• • • • • • • • 



Fig. 1. A thin convex body in 



that we start with the full-dimensional bounded convex body X G M” and 
that we consider the lattice . The problem is to determine whether there 
exists a vector x G (A n .S"). We refer to this problem as problem P. We use 
hj = Bj, 1 < j < n as a basis for the lattice where ej is the vector where 
all elements of the vector are equal to zero, except element j that is equal to one. 
To avoid having a convex body that is thin we apply a linear transformation t 
to X to make it appear “regular” . Problem P is equivalent to the problem of 
determining whether there exists a vector x G (rAHr.^"). The new convex body 
tA has a regular shape but the basis vectors TSj are not necessarily orthogonal 
any longer, so from the point of view of branching the difficulty is still present. 
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We can view this as having shifted the problem we had from the convex body 
to the lattice. This is where basis reduction proves useful. By applying the 
algorithm to the basis vectors re^, we obtain a new basis bi , . . . , b„ spanning the 
same lattice, tTZ"^ , but having short, nearly-orthogonal vectors. In particular it 
is possible to show that the distance d between any two consecutive hyperplanes 
H + fcb„, H + {k + l)b„, where H = IRbj and k G 7Z, is not too short, 

which means that if we branch on these hyperplanes, then there cannot be too 
many of them. Each branch at a certain level of the search tree corresponds to 
a subproblem with dimension one less than the dimension of its predecessor. In 
Fig. 2 we show how the distance between hyperplanes H + khn increases if we 
use a basis with orthogonal vectors instead of a basis with long non-orthogonal 
ones. 




(a) 




(b) 



Fig. 2. (a) Non-orthogonal basis, (b) Orthogonal basis. 



Due to the special structure of our instances we do not use a transformation 
T in our algorithm. We can simply write down an initial basis for a lattice 
that contains all vectors of interest for our problem, and then apply the 
algorithm directly to this basis. Our initial basis resembles the bases used by 
several authors for finding feasible solutions to subset sum problems that arise 
in certain cryptographic systems, see further Sect. 3. 

For the integer programming problem P, Lovasz and Scarf [13] developed an 
algorithm that, as Lenstra’s algorithm, uses branching on hyperplanes. Instead 
of using a transformation r to transform the convex body and the initial basis 
vectors, and then applying a basis reduction algorithm, their algorithm produces 
a “Lovasz-Scarf-reduced” basis by measuring the width of the considered con- 
vex body in different independent directions. Lovasz and Scarf’s definition of a 
reduced basis is a generalization of the definition given by Lenstra et al. [10]. 
Cook, Rutherford, Scarf and Shallcross [2] report on a successful implementa- 
tion of the Lovasz-Scarf algorithm. Cook et al. were able to solve some integer 
programming problems arising in network design that could not be solved by 
traditional LP-based branch-and-bound. 
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Integer programming is not the only application of lattice basis reduction. A 
prominent application is factoring polynomials with rational coefficients. Lenstra 
et al. [10] developed a polynomial-time algorithm based on basis reduction for 
finding a decomposition into irreducible factors of a non-zero polynomial in one 
variable with rational coefficients. In cryptography, basis reduction has been used 
to solve subset sum problems arising in connection with certain cryptosystems, 
see for instance [5], [9], [15], [16]. A recent application in cryptography is due 
to Coppersmith [3] who uses basis reduction to find small integer solutions to 
a polynomial in a single variable modulo N, and to a polynomial in two vari- 
ables over the integers. This has applications to some RSA-based cryptographic 
schemes. In extended g.c.d. -computations, basis reduction is used by for instance 
Havas, Majewski and Matthews [8]. Here, the aim is to find a short multiplier 
vector X such that ax = ag, where ag = gcd(ai, 02 , . . . , a„). 



3 Structure of Initial and Reduced Basis 



Here we consider a lattice that contains all vectors of interest to our problem 

( 1 ): 

does there exist a vector x G such that ax = oq, 0 < x < u? 

Without loss of generality we assume that gcd(ai, 02 , ..., fln) = 1. We formulate 
an initial basis B that generates this lattice and derive structural properties of 
the reduced basis RB obtained after applying the algorithm to B. 

Let xo denote the vector (xi, . . . , xg)^. We refer to Xq as the extended x- 

vector. Here, xg is a variable that we associate with the right-hand side coefficient 
ag. Given the bounds on the variables, we tackle problem (1) by trying to find 
a short vector x satisfying ax = ag. This can be done by finding short vectors 
in the lattice Ls containing the vectors 

{Xi,...,Xn,Xg, (aiXi-\ \~ a„X„ - QgXg))'^ , ( 7 ) 

where xi, . . . ,Xn,xg £ 7Z,. In particular, we want to find integral extended x- 
vectors lying in the null-space of ag = (oi, . . . ,a„,oo), denoted by N(ao), i.e., 
vectors xg G that satisfy ax — oga;g = 0. Moreover, if possible, we want 

X to satisfy the upper and lower bound constraints, and xg to be equal to one. 
Below we will show how we can use basis reduction to find such a vector if one 
exists. 

The lattice L spanned by the basis B given by 



/ 1 0 0 ••• 0 \ 

0 1 0 ••• 0 



B = 



0 0 

\N2a1 ■ • • 



1 

.. . 0 

• • • N2a„ 



0 

Ni 

— N2ag / 
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contains the vectors 

(x, Nixo, h)'^ = (xi, . . . , Nixo, N 2 {-aoXo + aiXi H h a„x„))^, (8) 

where and N 2 are large integral numbers. Note that the basis vectors are given 
columnwise, and that the basis consists of n + 1 vectors hj = {h\j, . . . , bn+ 2 ,j)'^ , 
I <j<n+l. A vector (xo,0)^ in the lattice Lg that belongs to N(ao,0) 
corresponds to a vector (x, NiXq, 0)^ in the lattice L. We will show below that 
by choosing the multipliers iVi and N 2 large enough the reduced basis will contain 
a vector (x^, 0)^, and vectors (x^, 0, 0, )^, 1 < j < n — 1. The vector x^ is a 

solution to the equation ax = oq, and the vectors Xj are solutions to ax = 0, i.e., 
they belong to the null-space N(a). Since the vectors xj, and xj, 1 < j < n — 1 
belong to the reduced basis we can expect them to be relatively short. 

Lemma 1 (Lenstra, Lenstra, Lovasz [10]). Let A C M” be a lattice with 
reduced basis bi,b 2 ,...,b/ G M". Let yi,y 2 , ■ ■ ■ ,yt G A, t < I be linearly 
independent. Then, 

||bj||2 < 2"-imax{||yi||M|y2|p,...,||yt|p} for l<j<t. (9) 

Let hj = {bij . . . ,bn+ 2 ,j)'^ , ^ < j < n + 1, denote the vectors of the reduced 
basis, RB, obtained by applying to B. 

Theorem 1. There exists numbers Nqi O'nd fVo 2 such that if N\ > Nqi, and if 
N 2 > N\Nq 2 , then the vectors hj G of the reduced basis RB have the 

following properties: 

1. bn+i,j = 0 /or 1 < j < n - 1, 

2. bn+ 2 ,j = 0 for 1 < j < n, 
d. bnj-l^n — 2^1- 

Proof. Without loss of generality we assume that o„ yf 0. Consider the following 
n vectors in Vj = (o„ey — Oje„,0)^, 1 < j < n — 1, v„ = (ooe„,a„)^. 

Next, define vectors zi, . . . , z„ as follows: 

= Bvy = (vy,0)^, l<j<n-l, (10) 

hyi — Bv,^ — (og^n 5 O-n 5 0) . (H) 

The vectors Zj, 1 < j < n belong to the lattice L. 

Choose A^oi such that 

iVo\ > 2”+imax{||zi|p,...,||z„_i|p} = 2"+ Vax{||vi|p, . . . , ||v„_i|p}. 
From Lemma 1 we have that 

||bj|p < 2"+^max{||zi|p,...,||z„_i|p} for 1 < / < n - 1. 

Suppose that &n+i,y yf 0 for some j : 1 < J < n — 1. Then ||bj|p > > Nf 

as Ni divides b„+ij. As a consequence, ||bj|p > A^g;^, which contradicts the 
outcome of Lemma 1. We therefore have that bn+i,j = 0 for 1 < j < n — 1. 
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Next, select -/V02 such that 

Nq2 > 2”’^^max{||vi|p,...,||v„|p}. 

Due to Lemma 1, the following holds for the reduced basis vectors bj, 1 < j < n: 

||bj||2 < 2”+imax{||zi|p,...,||z„|p} = 

2"+imax{||zi|p,...,||z„_i||2,ag + fVi2Q,2| ^ 

2"+imax{i|vi|p,...,||v„_i||2,a^ + 7Vfa2} < 

2"+imax{iivii|2, . . . , jjv„_ii|2, iV2||^J|2| < 

Nl 2"+imax{||vi||2,...,||v„_i||2,||v„||2} < N^N^2- 

If bn+ 2 ,j ^ 0 for some J : 1 < J < n, then ||bj|p > o^+ 2 ,j — -^2 since N 2 
divides bn+ 2 ,j- This implies that ||bj|p > NINq 2 , which yields a contradiction. 
We can therefore conclude that bn+ 2 ,j = 0 for 1 < j < n. 

Finally, we prove Property 3. The equation ax = oq has a feasible solution 
since we assume that gcd(ai, . . . , a„) = 1 and since ag is integral. Let be 
a solution to ax = oq. This implies that the vector (xd,JVi,0)^ = B(xd,l)^ 
belongs to L spanned by B. The lattice L is also spanned by the reduced ba- 
sis RB, and hence the vector (xd,7Vi,0)^ can be obtained as (xd, JVi,0)^ = 
RB(Ai,... ,A„+i)^. Properties 1 and 2 imply the following: 

OAi -l- • • • -l- 0A„_i -l- 6„+i,„A„ -l- 6„+i^ri+iA„+i = fVi (12) 



OAi 0A„ -|- bn+2,n+l-^n+l — 0 (13) 

From equation (13) we obtain A„+i = 0 as bn+ 2 ,n+i is equal to an integer 
multiple of 1V2- If we use A„+i = 0 in equation (12) we obtain 6„+i_„A„ = fVi, 
which implies that bn+i,n divides Ni. Moreover, divides bn, n+i (cf. the proof 

of Property 1). We can therefore conclude that = Ni. □ 



Example 1. Consider the following instance of problem (1): does there exist a 
vector X e ZZ^ such that: 



8, 400, OOOxi -f 4, 000, 000x2 + 15, 688x3 + 6, 720x4 -f 15x5 = 371, 065, 262; 
0 < xi < 45; 0 < X2 < 39; 0 < X3 < 349; 0 < X4 < 199; 0 < X5 < 170? 



Let ?Vi = 1, 000 and N 2 = 10, 000. The initial basis B looks as follows: 

( 



B = 



0 ' 

0 

0 

0 

Ni 



V 8,400,000iV2 4,000,000772 15,688^2 6,720^2 15^2 -371,065,262^2/ 
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After applying to B we obtain: 

0 1 5 36 2 \ 

0 -2 -10 17 -4 

15 -25 -95 39 -42 

35 -1 -76 8 -21 

-8 -72 72 -22 1 

0 0 0 1,000 0 

0 0 0 0 10,000 

Notice that the sixth ((n + l)-st) element of the first four (n — 1) basis vectors 
of RB are equal to zero, and that the last element ((n + 2)-nd) of the first five 
(n) basis vectors are equal to zero. We also note that bn+i,n = be , 5 = Ni and 
that the elements of the first n = 5 rows of RB are quite small. □ 

We conclude the section with a brief comparison between our initial basis 
B and various bases used when trying to find solutions to subset sum problems 
that arise in cryptography. In the cryptography application the message that 
the “sender” wants to transmit to the “receiver” is represented by a sequence 
X G {0, 1}" of “bits”. The receiver knows a sequence of numbers ai, . . . , a„, and 
instead of sending the actual message x, the sender sends a number oq = ax. 
Once the receiver knows oq he can recover the message x by solving the subset 
sum problem ax = oq. Here, the equation ax = oq is known to have a solution. 
Lagarias and Odlyzko [9] considered the following basis to generate the lattice 
containing vectors (x, (—ax + ooXo))^: 

/ j(n) Q(nxl) 

B = 

V -a 0,0 

Here, denotes the n-dimensional identity matrix, and denotes the 

n X 1-matrix consisting of zeros only. Lagarias and Odlyzko proposed a polyno- 
mial time algorithm based on basis reduction. There is no guarantee that the 
algorithm finds a feasible vector x, but the authors show that for “almost all” 
instances of density d = n/ log2{maXj aj) < 0.645, a feasible vector is found. 
Several similar bases have been considered later as input to algorithms for trying 
to find solutions to subset sum problems. Schnorr and Euchner [15] for instance 
used the basis 

/ diag(2)("^") l("xi)\ 

B = na nao 

\ 1 / 

where diag(2)^”^”) is the n x n-matrix with twos along the main diagonal and 
zeros otherwise. Here, a lattice vector v G that satisfies \vn+ 2 1 = 1) Vn+i = 

0 and Vj G {±1} for 0 < j < n, corresponds to a feasible vector xj = ^\vj — 
Vn+ 2 \, 0 < J < n. Schnorr and Euchner [15] proposed an algorithm that for 
“almost all” subset sum problem instances with d < 0.9408 finds a feasible 
vector. The algorithm uses the above basis as input. For further details on finding 
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feasible solutions to subset sum problems arising in cryptography we refer to the 
above references and to the references given in the introduction. 

The important differences between the approach described above and our 
approach are the following. First the question that is posed is different. We do 
not know a priori whether our instances have a feasible solution or not, and we 
want to solve the feasibility problem, i.e., if a feasible solution exists we want 
to find one, and if no feasible solution exists this should be verified. Hence, we 
propose a branching algorithm as described in the following section. We also use 
two large constants Ni and N 2 to “force” the algorithm to find interesting 
vectors. Moreover, we have shown, see [1], that our algorithm can be generalized 
to handle systems of linear diophantine equations. 



4 The Algorithm 



Here we discuss how we can use the properties stated in Theorem 1 to design an 
algorithm for solving the feasibility problem (1). Since we are only interested in 
vectors belonging to the null-space N(ao) we consider only the first n-|-l elements 
of the first n vectors of RB. We now have obtained a basis for n N(a, -^) 

with the following structure: 





RB' 




X 

Q(lx(n- 


-1)) 


Xd \ 

NiJ 


In our example above the basis RB' 


' corresponds to 




/ 


-10 


0 


1 


5 


36 \ 






21 


0 


-2 - 


-10 


17 






0 


15 


-25 - 


-95 


39 






0 


-35 


-1 - 


-76 


8 






0 


-8 


-72 


72 


-22 




1 


0 


0 


0 


0 


1,000 / 



The last column of the matrix RB\ (xrf,W)^ is a solution to the equation 
ax — ;^a;o = 0, which implies that the vector x^ is a solution to the equation 
ax = qq. All other columns of RB' are solutions to the equation ax — 0 = 0, 
i.e., the columns of the submatrix X all lie in the null-space N(a). 

In our algorithm we first check whether the vector x^ satisfies the lower and 
upper bounds. If yes, we are done. If any of the bounds is violated we search for a 
vector that is feasible, or for a proof that no feasible vector exists, by branching 
on linear integer multiples of the vectors in N(a). Note that by adding any linear 
integer combination of vectors in N(a), x^, to x^ we obtain a vector x^-l-xv that 
satisfies ax^ -f- x^ = ao- For the feasible instances the search for a feasible vector 
turned out to be easy. To speed up the algorithm for most of these instances we 
developed a heuristic as follows. Let be the j-th column of the submatrix X of 
RB'. Suppose that we are at iteration t of the heuristic and that an integer linear 
combination of to < f vectors of N(a) has been added to vector x^. The vector 
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obtained in this way is called the “current vector”. For simplicity we assume that 
only variable Xk of the current vector violates one of its bound constraints. At 
iteration t we add or subtract an integer multiple At of the column vector Xt if 
the violation of variable Xk’s bound constraint is reduced and if no other bound 
constraints becomes violated. As soon as the value of Xk satisfies its bounds, 
we do not consider any larger values of At. If the heuristic does not find any 
feasible solution, we call an exact branching algorithm that branches on linear 
combinations of vectors in N(a). A summary of the complete algorithm is given 
in Fig. 3. 



procedure main(a, ao, u) 

begin 

store initial basis B; 

RB = L3(B); 

extract RB' from RB; 

if 0 < Xd < u then return x^; 

heuristic(RB); 

if heuristic fails then 

branch on linear combinations of columns j = 1, . . . , n — 1 of the submatrix X; 
return feasible vector x, or a proof that no such vector exists; 

end 



Fig. 3. Algorithm 1. 



In our example we can see that by subtracting the third column of X, 
X3 = (1, —2, -25,-1, —72)^, from = (36, 17, 39, 8, —22)^, we obtain a vector 
x = (35, 19, 64, 9, 50)^ that satisfies the equation as well as all the bounds. 

5 Computational Experience 

We solved thirteen instances of problem (1). Eight of the instances were feasible 
and five infeasible. The instances starting with “P” in Table 1 were obtained 
from Philips Research Labs. The instances starting with “F” are the Frobenius 
instances of Cornuejols et al. [4]. Here we used the Frobenius number as right- 
hand side oq. The two other instances, starting with “E”, were derived from 
F3 and F4. The information in Table 1 is interpreted as follows. In the first 
two columns, “Instance” and “n”, the instance names and the dimension of 
the instances are given. An “F” in column “Type” means that the instance is 
feasible, and an “N” that it is not feasible. In the two columns of LP-based 
branch-and-bound, “LP B&B”, the number of nodes and the computing time 
are given. In the Nodes” column, 500, 000* means that we terminated the 
search after 500,000 nodes without reaching a result. Two asterisks after the 
number of nodes indicate that a rounding error occurred, i.e., that the rounded 
solution given by the algorithm did not satisfy the diophantine equation. In both 
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Table 1. Results of the computational experiments. 









LP B&B 




Algorithm 


Instance 


n 


Type 


# Nodes 


Time (s) 


Heur. 


# Nodes 


Time (s) 


PI 


5 


F 


420“ 


_ 


1 




< 10"® 


P2 


5 


F 


327 


0.09 


1 




< 10"® 


P3 


4 


F 


75 


0.05 


0 




< 10"® 


P4 


5 


F 


313“ 


- 


1 




< 10"® 


P5 


5 


F 


231 


0.11 


2 




< 10"® 


P6 


5 


F 


313“ 


- 


1 




< 10"® 


El 


6 


F 


3,271 


0.97 


0 




< 10"® 


E2 


7 


F 


500, 000* 


- 


0 




< 10"® 


FI 


5 


N 


500, 000* 


- 


- 


1 


< 10"® 


F2 


6 


N 


500, 000* 


- 


- 


5 


0.01 


F3 


6 


N 


500, 000* 


- 


- 


1 


< 10"® 


F4 


7 


N 


500, 000* 


- 


- 


1 


0.01 


F5 


8 


N 


500, 000* 


- 


- 


5 


0.01 



cases we do not report on the computing times since no result was obtained. In 
the three columns corresponding to our algorithm, “Algorithm”, the column 
“Heur.” gives the number of vectors belonging to N(a) that was used in the 
integer linear combination of vectors added to the vector by the heuristic in 
order to obtain a feasible solution. Notice that for every feasible instance, the 
heuristic found a feasible solution. A zero in column “Heur.” therefore means that 
the vector was feasible. For the infeasible instances the heuristic obviously 
failed, and therefore the sign is given in the column. In that case we turn to 
the branching phase. Here, a one in the column Nodes” means that we solved 
the problem in the root node by using logical implications. The computing times 
are given in seconds on a 144MHz Sun Ultra-1. For the LP-based branch-and- 
bound we used CPLEX version 4.0.9 [6], and in our algorithm we used LiDIA, a 
library for computational number theory [12], for computing the reduced basis. 

Our results indicate that the instances are rather trivial once they are rep- 
resented in a good way. Using the basis Sj and branching on variables as in 
LP-based branch-and-bound is clearly not a good approach here, but it is the 
standard way of tackling integer programs. Using basis reduction seems to give 
a more natural representation of the problem. For our instances the computing 
times were very short, and, contrary to LP-based branch-and-bound, we avoid 
round-off errors. It is also worth noticing that the infeasibility of instances F1-F5 
was particularly quickly verified using our algorithm. 
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Abstract. This paper introduces a scheme of deriving strong cutting 
planes for a general integer programming problem. The scheme is related 
to Chvatal-Gomory cutting planes and important special cases such as 
odd hole and clique inequalities for the stable set polyhedron or families 
of inequalities for the knapsack polyhedron. We analyze how relations 
between covering and incomparability numbers associated with the ma- 
trix can be used to bound coefficients in these inequalities. For the in- 
tersection of several knapsack polyhedra, incomparabilities between the 
column vectors of the associated matrix will be shown to transfer into 
inequalities of the associated polyhedron. Our scheme has been incorpo- 
rated into the mixed integer programming code SIP. About experimental 
results will be reported. 



1 Introduction 

LP-based branch-and-bound algorithms are currently the most important tool 
to deal with real-world general mixed integer programming problems compu- 
tationally. Usually, the LP relaxations that occur during the execution of such 
methods are strengthened by cutting planes. Cutting planes for integer programs 
may be classified with regard to the question whether their derivation requires 
knowledge about the structure of the underlying constraint matrix. Examples 
of families of cutting planes that do not exploit the structure of the constraint 
matrix are Chvatal-Gomory cuts [6], [4], [12] or lift-and-project cuts [1]. An 
alternative approach to obtain cutting planes for an integer program follows es- 
sentially the scheme to derive relaxations associated with certain substructures 
of the underlying constraint matrix, and tries to find valid inequalities for these 
relaxations. Crowder, Johnson and Padberg [5] applied this methodology by in- 
terpreting each single row of the constraint matrix as a knapsack relaxation and 
strengthened the integer program by adding violated knapsack inequalities. An 
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analysis of other important relaxations of an integer program allows to incor- 
porate odd hole and clique inequalities for the stable set polyhedron [8] or flow 
cover inequalities for certain mixed integer models [11]. Further recent examples 
of this second approach are given in [3] , [7] . 

Our paper follows the methodology to obtain cutting planes for an integer 
program by investigating relaxations of it. We try to go one step further and 
investigate the intersection of two or more knapsack polyhedra. We describe a 
general family of valid inequalities for an integer program that are associated 
with its feasible solutions. Usually such inequalities must be lifted in order to 
induce high dimensional faces. We derive lower and upper bounds on the exact 
lifting coefficients of such an inequality and discuss special cases when these 
bounds can be computed in polynomial time. We also relate our family of in- 
equalities to Chvatal-Gomory cuts and discuss in detail the special case where 
only two knapsacks are involved. The use of feasible set inequalities within an 
implementation for the solution of general mixed integer programming problems 
is investigated. 

Consider some matrix A G IR’”^", vectors b G IR*”, u G IR", and the poly tope 

P := conv {x G : Ax < 6, 0 < a; < u}, 

that is the convex hull of all integral vectors x satisfying Ax < b and 0 < x < u. 
Set N := n}. For S C N, define Ps := P O {x G IR" : Xi = 0 for i G 

N \ S'}, where Pn = P, and, for x G IR", denote by x\s ■= {xi)i^s the vector 
restricted to the components in S. 

Definition 1. Let T C N such that ^ ^ ^ with v < u\t- 

T is called a feasible set. Let w \ T ^ ZZ be some weighting of the elements of 
T. For j G N\T with XieT + A.jUj ^ b , the inequality 

'^WiXi+WjXj <'^WiUi (1) 

zGT i^T 

is called a feasible set inequality associated with T (and {j}) if 
• ^ 

Wj < mm - mm > WiXi 

1^1,. ..,Ui I X 

^ iGT 

A.iXi > A.j - r{T) ( 2 ) 

i&T 

0 < Xi < Ui,Xi G TZi, i gT, 

where r{T) := b - 



Theorem 2. Feasible set inequalities are valid for Tru{i}- 



Proof Let 7 = Ti := niax {XjgT : Yiex ^ 

b, 0 < Xi < Ui,Xi G 2Z^ i GT}. After complementing variables Xi to Ui — Xi for 
i G T we obtain that the right hand side of (2) is min;=i_,,,_„^ j(7~7/)- For some 
feasible solution x G P we have Yier < 7$„ + xj min;=i_,,,_„^ j(7 ~ 



7/) <7$,- +(7-7£j) = 7- 



□ 
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Examples of feasible set inequalities include (1, fc)-configuration and mini- 
mal-cover inequalities that are known for the knapsack polytope K := conv {x G 
{0,1}" : a^x < a} with a G IR",a > 0. Let S' C fV be a minimal cover, i.e., 
a(S) > a and a(S \ (zj) < a for all z G S, and partition S into T and {j} 
for some j G S. Set Wi := 1 for all i G T. The feasible set inequality reads 
^ 1^1 = l'5'l - 1 with Wj < min{|E| : V C - 

ttj — r(T)}. Since + ^C^) = cn and Xzes minimum is greater 

than or equal to one. Therefore, under the regularity condition imposed in [10], 
the feasible set inequality is always a (1, /c)-configuration inequality. In case the 
coefficient happens to be one we get a minimal cover inequality, see, for instance, 
[13]. 

Theorem 2 states the validity of the feasible set inequality for Tru{i}- To 
obtain a (strong) valid inequality for P we resort to lifting, see [9]. Consider 
some permutation 

TTi, . . . , ■Kn-\T\-i of the set IV \ (T U {j}). For k = 1, . . . , rz — |Tj — 1 and 
1 = 1 ,.. . ,zz,Tfc let 

7 (fc, 1) = max WiXi + WiXi 

ieTU{j} iG{7ri,...,7rfc_i} 

^ ' A-iXi 'y ] A.iXi -\- < h (3) 

ieTU{j} iG{iri,...,ir)c-i} 

0 < Xi < Ui,Xi G Z for z G T U {tti, . . . , 7Tfe_i}. 

Let 7 = the lifting coefficients are 



Wtt,, ■= min 



I 



(4) 



The following statement is immediate. 

Theorem 3. The (lifted) feasible set inequality vlAx < Xzer for 

P. 



Note that the right hand side of (2) coincides with (4) applied to variable j 
if we substitute in (3) the set T U {j} by T. In other words, a lifted feasible set 
inequality associated with T and {j}, where the variables in IV \ (T U {j}) are 
lifted according to the sequence tti, . . . ,7t„_|t|-i, coincides with the inequality 
associated with T, where j is lifted first, and the remaining variables N\{TU{j}) 
are lifted in the same order tti, . . . ,'Kn-\T\-i- Thus, instead of speaking of a 
feasible set inequality associated with T and {j}, we speak in the sequel of a 
feasible set inequality associated with T and view j as the variable that is lifted 
first. 

Odd hole- and clique inequalities for the set packing polytope are examples of 
lifted feasible set inequalities. Consider the set packing polytope P = convjx G 
{0, 1}" : Ax < 1} for some 0/1 matrix A G (0, Let Ga = (V,E) denote 
the associated column intersection graph whose nodes correspond to the columns 
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of A and nodes i and j are adjacent if and only if the columns associated with 
i and j intersect in some row. Let Q C V be a clique in Ga, then the clique 
inequality < 1 is valid for P. To see that this inequality is a lifted 

feasible set inequality, let T = {i} for some i G Q. The feasible set inequality 
cci < 1 is valid for P{i}- Lifting the remaining variables k € Q \ {z} by applying 
formula (4) yields tCfc = 1, and the clique inequality follows. 



2 Bounds on the Lifting Coefficients 

For a feasible set inequality associated with T, the calculation of the lifting 
coefficients for the variables in fV \ T requires the solution of an integer program. 
In this section we study lower and upper bounds for these coefficients. It will turn 
out that these bounds are sometimes easier to compute. We assume throughout 
the section that A > 0 and Wi > 0 for i € T. 

Definition 4. Let T C N be a feasible set and w \ T ^ IR^^ a weighting ofT. 
For V G IR™ we define the 

Covering Number 

4>- (v) := min WiXi : A.iXi > v, 0 < Xi < Ui, Xi G z S T}, 

ieT iGT 

>-Incomparability Number 

fA (v) := min WiXi : A.iXi ^ v, 0 < Xi < Ui, Xi G i GT, 

ieT ieT 

A3JgT, Xj < Uj : A.iXi + A.j > z;}, 

ieT 

<-Incomparability Number 

4>^ (v) := min {E WiXi : E A.iXi ^ V, 0 < Xi < Ui,Xi G TZi^ z S T}, 

ieT iGT 

where we set (jA{v) := 0 and := 0 for u < 0. 

Consider a (lifted) feasible set inequality w'^x < X^ieT associated with 
T, where the variables in N \ T are lifted in the sequence tti, . . . ,7t„_|j’|. The 
following proposition gives upper bounds for the lifting coefficients derived from 
the covering number. 

Proposition 5. 

(a) zc,ri = min/=i^..._„^^ \ (j>^{A.^J - r(T)). 

(b) } (j>^{A..^J - r(T)), for k = 2, . . . ,n - \T\. 
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Proof, (a) directly follows from Theorem 2. To see (b), it suffices to show that 
7 — 7(/c, 1) < (j)-{A.Tr^l — r{T)) for I = 1 , see (4). This relation is obtained 
by 

7 - l) = j- max J2ieT w^x^ + w^x^ 

'^i^T ^-iXi + ^-iXi + < 7, 

0 < a; < G 

min X;i6T W^X^ - w^x^ 

'^ieT ^-iXi ~ X)ie{7ri,...,irfc_i} ^-iXi ^ ~ x{T), 

0<x< G zTu{.^,...,.k-^} 

< min X^ier 

0 < X < u\t, X G 
= cf^{A.^J-r{T)), 

where the second equation follows by complementing variables Xi, i & T. □ 

To derive lower bounds on the lifting coefficients we need the following rela- 
tions. 

Lemma 6. For vi,V 2 G with ui,U2 > 0 holds: 

(a) > (j)^{vi) and (j>-{vi) > (j)^{vi). 

(b) 4>-,(jA, and 4>^ are monotonically increasing, that is for v\ > V 2 , 4>-{vi) > 
4 >-{v 2), 4>^{vi) > 4 >^{v 2), and 4>^{vi) > 4>^{v2). 

(c) 4>-{vi + V 2 ) > 4>^{vi) + 4 >^{v2). 

(d) 4>-{vi + V 2 ) + maxjwj : t G T} > 4)^{vi) + 4>-{v2). 

(e) + V 2 ) + maxjwj : t G T} > + <t)^{v 2 ). 

Proof, (a) and (b) are obvious, the proofs of (c) and (e) follow the same line. 
We show exemplarily (c). Let x G IR^ with 0 < a; < u\t and XieT — 
ui -I- V 2 such that '^^^rpWiXi = (j>-{vi + V 2 ). If ui = U2 = 0 the statement 
is trivial. Otherwise, suppose w.l.o.g. v\ > 0. Let z G with z < x such 
that '^i^T '^i(zT'WiZi is minimal. Since ui > 0 there exists 

some io € T with Zi„ > 0, and since z was chosen to be minimal, we have that 
^-io ^1- This implies that Xigt ^ t>2. Summing 

up, we get 4i-{vi + V 2 ) = ~ “ 

Zi) + A.i,j) > (j)^{vi) + (j)^{v 2 ). Finally, (d) directly follows from (c) and the fact 
that -I- max{wi : t G T} > ^-(wi). □ 

With the help of Lemma 6 we are able to bound the lifting coefficients from 
below. 

Theorem 7. For A: = 1, . . . , n — |T| we have 
^ i^-Tv J-r{T)) 

WTTk > min maxjwi : i G J |. (5) 
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n r T 2 ^ — r(T)) ^ , ,1 

Frooj. Let c^j, := 1 maxjwi : z € i | denote the 

right hand side of (5), for A: = 1, . . . , n — |T|. We show by induction on k that 
the inequality J2iex + X)i=l ^TTiXi < X)i6T WiUi is valid. For A: = 1, the 
statement follows from Proposition 5 (a) and Lemma 6 (a). Now let A: > 2 and 
suppose the statement is true for alH < k. Let x be an optimal solution of 

k 

max WiXi + 

iGT 2=1 

k 

2GT 2=1 

0 < Xi < Ui, Xi € ^ for i € T U {tti, . . . , tt^}. 

We must show that ^-rriX-Ki < XiGT First note that the 

inequality is valid if Xtt^ = 0. This is equivalent to saying 

k-l k-1 

^ ^ -A-ti-^Xti-^ r(T)^ ^ ^ ^ , 

2=1 2=1 

for all X G with A-Tr^x-TTi < b, 0 < x-j^. < i = 1. 

Applying Lemma 6 (d) and (b) we obtain with Wmax ■= max{u;i : i GT} 



k k—1 

4>- ~^(0)) + '/'^(^•Trfc^TrJ - Wmax 
2=1 2=1 

k-l 

^ ^ ^ Ctj-^Xtj-^ T Xtt-^ r(X")) U)max 

2=1 



k-l 

2=1 



+ Xtt. min 

/ = !,.. ., 22 fc 



I 



r(T)) 



'^max 



k 

^ ^ Ctj-^Xtj-^ . 
2=1 



On account of 
follows. 



Xi) > 4'~Gl\=i^-^iXTTi — f{T)) the statement 

□ 



Theorem 7 applies, in particular, if we set the coefficient of the first lifted 
variable to the upper bound of Proposition 5 (a). The subsequent example 
shows that in this case the lower bounds given in Theorem 7 may be tight. 



Example 8. Let A = 



11668613 



and b = 



14 

12 



. The set T = {1,2, 3, 4} is 



46113921 

feasible for the 0/1 program maxjc^a: : x G P} with P := convja: G (0, 1}® : 
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Ax < b}. For Wi = I, i £ T, we obtain <f)- = 3. Moreover, (j)^ = 2, because 

(g) > A.i for i £ T. Accordingly we get 4>^{2) = 1 = The inequality 

xi + X 2 + xz + xa + (p- (g)a;5 + i^-i) ~ < 4 reads x\ + X 2 + xz + 

X 4 + 3x5 + xq <4. It defines a facet of P. 

The question remains, whether the values <j)-,<pA and (p^ are easier to com- 
pute than the exact lifting coefficient. Indeed, they sometimes are. Suppose 
Wi = \ for all z G T and consider the comparability digraph G = (V, E) that 
is obtained by introducing a node for each column and arcs (z, j) if A.i > A.j 
and A.i yf A.j or if A.i = A.j and z > j, for i,j £ {1, . . . ,rz} (where transitive 
arcs may be deleted). Let r denote the number of nodes with indegree zero, i. e., 
6~ (i) = 0. Then, (p-,(p^ and (p^ can be computed in time 0{’tf + a), where a 
is the time to construct the comparability digraph. For example, in case of one 
knapsack inequality the comparability digraph turns out to be a path, and thus 
(p- , and p^ can be computed in time 0{n + zzlogzz) = O(zzlogzz). 



3 Connection to Chvatal-Gomory Cnts 



So far we have been discussing feasible set inequalities for general integer pro- 
grams. With the exception that with u\t also every vector v < u\t is valid for 
P we have not subsumed any assumptions on the matrix A. Thus a comparison 
to Chvatal-Gomory cutting planes that do not rely on any particular struc- 
ture of A is natural. Recall that Chvatal-Gomory inequalities for the system 
Ax <b, 0<x<u, x£ 7Z^ are cutting planes <PF x < b such that di = , 



z = 1, . . . , zz, and 

6 = for some A € IR™'*'", where A = 



'A' 




'h 


I 


and b = 


u 



Consider a (lifted) feasible set inequality w'^x < ’’XiUi associated with 

T, whose remaining variables N \ T are lifted in the sequence tti, . . . ,7t„_|t|. 
This lifted feasible set inequality is compared to Chvatal-Gomory inequalities 
resulting from multipliers A G IR™'*'" that satisfy [A^A.^J = Wi for i £ T. 

Proposition 9. 

(a) [A^6J > 

(b) If LA^Sj = XzeT UiWi, let j be the smallest index with A 

Then, [A^A..„.J 



Proof. Since T is a feasible set, (a) is obviously true. To see (b) suppose the 
contrary and let j be the first index with \}A A. A A and A. A > 
Set 7 = UiWi and consider an optimal solution x G 

of (3) such that Obviously, x can be extended to a feasible 

solution X of P by setting Xj = Xj, if z G T U {tti, . . . ,TTj}, Xi = 0, otherwise. 
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This solution satisfies the feasible set inequality with equality, since uFx = 
X]»eTu{7u.....77,_i} WiX^+WT,^Xj = 7(j, + = 7- On the other hand, j 

is the first index where the Chvatal-Gomory and the feasible set coefficient differ 
and we conclude Eieiv > 

Si6Tu{-n-i ij = 7 = [A^&J , Contradicting the validity of the 

Chvatal-Gomory inequality. □ 

As soon as the first two coefficients differ, for k G . . . , 7r„_|7’|}, no 

further statements on the relations of the coefficients are possible, in general. 

Example 10. For 6 G IN, let P{b) be the convex hull of all 0/1 solutions that 
satisfy the knapsack inequality 

2xi -I- 6x2 + 8x3 -I- 9x4 + 9x5 + 21x6 + 4x7 < b. 

One Chvatal-Gomory cutting plane for P{b) reads 

X1 -I- 3x2 + 4x3 + 4x4 + 4x5 + 10x6 + 2x7 < L^/2J ■ 

Let b = 25. The set T := {1, 2, 3, 4} is a feasible set. Choosing coefficients 1, 3, 4, 4 
for the items in T and lifting the items 5, 6, 7 in this order we obtain the lifted 
feasible set inequality that is valid for P(25): 

xi -I- 3x2 + 4x3 + 4x4 + 4x5 + lla^e + x-j < 12. 

The right hand side of the Chvatal-Gomory cutting plane and the lifted feasible 
set inequality coincide. With respect to the lifting order 5,6,7 the coefficient 
of item 6 is the first one in which the two inequalities differ. This coefficient is 
11 in the feasible set inequality and 10 in the Chvatal-Gomory cutting plane. 
For item 7 the coefficient in the feasible set inequality is then smaller than the 
corresponding one in the Chvatal-Gomory cutting plane. For 6 = 27 we obtain 
the lifted feasible set inequality that is valid for P{27): 

Xi -I- 3x2 + 4x3 + 4x4 + 4x5 + 9x6 + X-J < 12. 

The right hand side of this inequality is by one smaller than the right hand side 
of the corresponding Chvatal-Gomory cutting plane for A = ^. However, the 
coefficients of the items 6 and 7 are smaller than the corresponding coefficients 
of the Chvatal-Gomory cutting plane. 

Under certain conditions a feasible set- and a Chvatal-Gomory cutting plane 
coincide. 

Theorem 11. Let A G ]N™^” and T he a feasible set of the integer program 
maxjc^x : Ax <6, 0<x<ll, xG Let A G such that [A^6J = 

[A^A.iJ . Lf, for all j G N \ T , column vector A.j is a 0 /I- combination of 
elements from {A.i : i G T}, then 

[\^A.,\=f>>-{A.,-r{T)). 
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Proof. We first show that A. j\ > <j>-{A.j — r{T)). Since A.j is a 0/ 1-com- 
bination of elements from {A.i : i G T}, there exist a G {0,1}^ such that 
= X^.j. Thus, = \_X^ \ — = 

Y.^eT<^iW^ > -r{T)). 

To show the opposite relation, we know by Proposition 5 that the coefficient of 
the feasible set inequality wj satisfies wj < (j>-{A.j — r{T)). By Proposition 9 (b), 
however, we know that, if wj does not coincide with [X^ A.j\ for all j G N \ T, 
there is at least one index j with wj > [X^ A.j\ . This together with the first 
part of the proof implies (p-{A.j — r(T)) < [X^ A.j\ < Wj < (f-{A.j — r{T)), a 
contradiction. Thus, wj = [X^ A.j\ for all j, and the claim follows by Proposition 
5. □ 

By Proposition 5 the expression (j)-(A.j —r(T)) is an upper bound on the ex- 
act coefficient of an item j G N\T in any lifted feasible set inequality associated 
with the feasible set T and the weighting t G T. On the other hand, the 

Chvatal-Gomory cutting plane [X^ A.j\ < [X^b\ is valid for P. Therefore, 

this Chvatal-Gomory cutting plane must coincide with any lifted feasible set in- 
equality -I- independent on the sequence 

in which the lifting coefficients Wj for the items in N \ T are computed. 

4 Consecutively Intersecting Knapsacks 

So far we have been discussing a framework with which one can define and ex- 
plain families of cutting planes for a general integer program. On the other hand, 
from a practical point of view the cutting plane phase of usual codes for integer 
programming relies in particular on valid inequalities for knapsack polyhedra. 
From both a theoretical and a practical point of view it would be desirable to 
understand under what conditions facets of single knapsack polyhedra define or 
do not define strong cutting planes of an integer program when several knapsack 
constraints intersect. This question is addressed in this section. In fact, here we 
study a special family of 0/1 programs that 

arises when A G and Ax < b defines a system of consecutively inter- 

secting knapsack constraints. Throughout this section we assume u = 1, A > 0 
and integral. For i = 1, . . . , m, let := supp(4i.) and P® := conv {x G {0, 1}'^* : 

Definition 12. A system of linear inequalities Ax < b is ealled a system of 
consecutively intersecting knapsack constraints if A G and G fV; = 0 

for all i,l G {!,..., m}, \i — l\ >2. 

A natural question when one starts investigating the intersection of several 
knapsack polyhedra is when this polyhedron inherits all the facets of the single 
knapsack polyhedra. 

Proposition 13. Let A G and Ax < b be a system of eonsecutively 

intersecting knapsack constraints. Let i G {!,..., m}. Lf Nid NiU {fc} is not a 
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cover for all I G {i — l,i+ 1} and k ^ Ni, then every facet- defining inequality of 
the knapsack polyhedron P* defines a facet of P. 

Proof. Suppose that CjXj < 7 defines a facet of P*. Let — 7 

define a facet P of P such that 

Fc ■= {x G P : CjXj = 7} C P. 

j&Ni 

Let x° G n Pc be a vector such that = 0 for all j ^ Ni. 

Consider some j ^ Ni, and let j G Ni for some I ^ i. If \l — i\ > 2, obviously 
x° + €j G Pc- In the other case, we know NiflNiU {j} is not a cover (on account 
of the conditions in the proposition), and thus x° + ej € Pc as well. This implies 
that dj = 0. Because XieAf, — 7 defines a facet of P* we obtain 

dim(Pc) > dim(P®) — 1 + |A^| — \Ni\ = |A^| — 1 > dim(P) — 1. 

Therefore, Pc defines a facet of P that coincides with P. □ 



The condition that, for every k ^ Ni, I = i — l,i 1, the set {Ni n Ni) U {k} 
is not a cover, is essential for the correctness of the proposition as the following 
example shows. 



Example I 4 . For 6 € IN \ {0} let A be the matrix 



1 + 1 1 + 1 1 + 1 0 



0 



1+1 1+1 b 



and consider P = conv {x G {0, 1}^ : Ax < b}. Then fVi n fV 2 = {2, 3}, and the 
set {2,3,4} defines a cover. 

The inequality x\ X 2 x^ <2 defines a facet of the knapsack polyhedron 



convjx G {0, 1}^ : (- + l)a;i + (7 + l)a:2 + (7 + l)a;3 < b}. 



On the other hand, the inequality + a ;2 + 2:3 < 2 is not facet-defining for P, 
since the face P = (x G P : + 0:2 + 2:3 < 2} is strictly contained in the face 

induced by the inequality xi + X 2 + X 3 + X 4 < 2 . 



In certain cases, a complete description of P may even be derived from the 
description of the single knapsack polyhedra. 

Theorem 15. Let m = 2 and A G Let Ax < b be a system of con- 

secutively intersecting knapsack constraints such that every pair of items from 
Ni D N 2 is a cover. For i = 1,2 let (7*x < be a system of inequalities that 
describes the single knapsack polyhedron PL 

Then, P is described by the system of inequalities 

'l2j&NinN2 ^ 

< 7 * for i = 1,2. 
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Proof. Note that the system of linear inequalities given in the theorem is valid for 
P. To see that it suffices to describe P, let (Fx < 7 be a non-trivial facet-defining 
inequality of P that is not a positive multiple of the inequality X)j6ArinAf2 
W.l.o.g. we assume that supp(c) n yf 0. Let Z = {zi, . . . , Zt} = Ni D N2- We 
claim that (supp(c) \ Z) n A^2 = 0- Suppose the contrary. We define natural 
numbers 7°, 7^, . . . , 7* by 

7° := max{ ^ CjXj : ^ ^ij^j < ^i) ^ £ {0, 

j&Ni\Z j&Ni\Z 

7* := max{ X) < h ~ x £ {0, 

jGNi\Z j&Ni\Z 

and claim that the face induced by the inequality c^x < 7 is contained in the 
face induced by the inequality 

i 

jewAz i=i 

This inequality is valid for P by definition of the values 7*, i = and 

because — 1- 

Let X £ P n ZZ^ such that x = 7. If ~ ^ ~ since 

otherwise we obtain a contradiction to the validity of (F x < 7. If 'Yhi^z ^ 
then — 1- Let Zi £ Z with Xzi = 1- Then c^x = 7 implies that x — e^' 

is an optimal solution of the program 

max{ E w £ {0, 1}^AZ . < 5^ _ AizJ. 

jGNi\Z 

This shows that in this case x satisfies inequality (6) as an equation, too. We 
obtain that < 7 must be a facet of the knapsack polyhedron P^. This 
completes the proof. □ 

The correctness of the theorem strongly relies on the fact that every pair of 
items from the intersection Ni n N2 is a cover. If this condition is not satisfied, 
then Example 14 demonstrates that the facet-defining inequalities of the two 
single knapsack polyhedra do not suffice in general to describe the polyhedron 
associated with the intersection of the two knapsack constraints. Geometrically, 
the fact that we intersect two knapsack constraints generates incomparabilities 
between the column vectors of the associated matrix. These incomparabilities 
give rise to cutting planes that do not define facets of one of the two single 
knapsack polyhedra that we intersect. In fact, incomparabilities between col- 
umn vectors in a matrix make it possible to “melt” inequalities from different 
knapsack polyhedra. A basic situation to which the operation of melting applies 
is 

Proposition 16 . Let m = 2 , A £ IN^^" and Ax < b be a system of two eonsecu- 
tively intersecting knapsack constraints. Let X)i6ArAAf2 X)i6ArinJV2 ^ 
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he a valid inequality for , and let '^iGNinN 2 ^ he a 

valid inequality for . Setting 0 := X)i6Ari\AT2 the melted inequality 

CtXi + ^ CiXi + ^ (ci - 0)~'~Xi < 7 

iGNi\N 2 iGNinN2 iGN2\Ni 

is valid for P. 

Proof. Let x G P H 2Z^ . If = 0 for all i ^ Ni\N\ with a — & > 0, the 
inequality is satisfied because J 2 ieNi\N 2 + J 2 teNinN 2 < 7 is valid for 



P^. Otherwise, 


GN2\Nii'^i < ^i^N2\Ni 


and we obtain 


ciXj 


-b ^ CiXi 


+ X! “ O^Xi 




ieNi\N2 


iGNinN2 


ieN2\Ni 






< ^ CjXi 


— 0 -b ^ ' CiXi -b 


CjXi 




ieNi\N2 


i£Nr<~\N2 


ieN2\Ni 




< ^ CiXi 


+ ^ CiXi 






iGNinN2 


ieN2\Ni 





< 7- 



□ 



Proposition 16 can be extended to general upper bounds u G IM”. Often the 
inequality that results from melting valid inequalities from knapsack polyhedra 
as described in Proposition 16 does not define a facet of P. In such situations 
there is a good chance of strengthening the melted inequality by determining 
lifting coefficients for the items in (A^i \ N 2 ) U {N 2 \Ni) with respect to a given 
order. 



Example 17. Let A be the matrix 



1 45 6 0 0 
0 0 5 6 1 4 



and b = 



15 

15 



. The inequality 



xi + 4x2 + 5x3 + 6x4 < 15 defines a facet of P^ := conv{x G : xi + 
4x2 + 5x3 + 6x4 < 15, 0 < Xi < 3, i G {1,2, 3, 4}}. For a G {0, 1, 2, 3} the 
inequality 5x3 + 6x4 + axQ < 15 is valid for := convjx G ^13,4,5,6} . 
5x3 + 6x4 + X5 + 4x6 < 15, 0 < Xi < 3, z G {3,4, 5}, 0 < xq < 1}. Setting 
a = 1 and applying Proposition 16 and the succeeding remark we obtain that 
Oxi + 3x2 + 5x3 + 6x4 + X6 < 15 is valid for P := conv {x G : Ax < b, 0 < 
Xi < 3, z G {1, 2, 3, 4, 5}, 0 < xe < 1}. This inequality can be strengthened by 
lifting to yield the facet-defining inequality xi -I- 3x2 + 5x3 + 6x4 -I- xe < 15. For 
a = 2, we end up with the melted inequality Oxi -I- 2x2 -I- 5x3 -I- 6x4 -1-2x6 < 15. For 
a = 3 we obtain the inequality Oxi -I- X2 -I- 5x3 + 6x4 -I- 3x6 <15 that can again be 
strengthened to yield a facet-defining inequality of P, X2 -I- 5x3 -I- 6x4 -I- X5 -I- 3x6 < 



15. 



It turns out that sometimes the feasible set inequalities for two consecu- 
tively intersecting knapsacks can be interpreted in terms of melting feasible set 
inequalities for the associated single knapsack polytopes. 
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Example 18. Consider A 



15 3 5 13 0 0 
0 17 18 17 19 20 



and b = 



20 

35 



. The set T = 



{2, 3} is feasible, and the inequality X 2 + + X 4 + 2 x^ + 2 xq < 2 is a feasible 

set inequality for := conv {x G {0, 1}{2.3,4,5,6} . _l_ _l_ \'jx 4 + 19x5 + 

20x6 < 35}, where the variables not in T are lifted in the sequence 4, 5, 6. In 
addition, xi + X2 + X3 + X4 < 2 is a feasible set inequality for := conv {x G 
{0, ljC’2,3,4} . + 3x2 + 5x3 + 13x4 < 20}, with respect to the same feasible 

set T and the lifting sequence 4, 1. Now 0 = = 1 and the melted 

inequality reads xi + X2 + X3 + X4 + X5 + xe < 2, which is facet-defining for 
P := {x G {0, 1}® : Ax < b}. Note that the melted inequality is also a feasible 
set inequality with respect to T and the lifting sequence 4, 1, 5, 6. 



Note that Example 18 also shows that under certain conditions the operation 
of melting feasible set inequalities produces a facet-defining inequality for P. 



5 Computational Experience 

In this section we briefly report on our computational experience with a sep- 
aration algorithm for the feasible set inequalities. We have incorporated this 
algorithm in a general mixed integer programming solver, called SIP, and tested 
it on instances from the MIPLIB 3.0 ([2]). Details of our separation algorithm 
such as how to determine a feasible set T, how to weight the variables in P, how 
to perform the lifting, and for which substructures of the underlying constraint 
matrix the separation algorithm should be called, will be given in a forthcoming 
paper. We compared SIP with and without using feasible set inequalities. The 
time limit for our runs was 3600 CPU seconds on a Sun Enterprise 3000. It turns 
out that for 14 out of 59 problems {air05, fiber, gesa2, gesa2.o, gesaS.o, niiscOS, 
misc07, p0033, p0201, p2756, qnetl, Seymour, stein27, steindS) we find feasible 
set inequalities or our separation routines uses more than 1% of the total running 
time. 



Table 1. Comparison of SIP with and without feasible set (FS) inequalities. 



Example 


B&B 

Nodes 


Cuts 

Others FS 


FS 


Time 

Total 


Gap % 


SIP without FS 
SIP with FS 


733843 

690812 


1241 0 

1450 1712 


0.0 

561.8 


18413.6 

18766.2 


17.38 

9.92 



Table 1 summarizes our results over these 14 problem instances. The first 
column gives the name of the problem. Column 2 the number of branch-and- 
bound nodes. The following two columns headed Cuts give the number of cutting 
planes found. Others are those that are found by the default separation routines 
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in SIP, and FS shows the number of feasible set inequalities added. Columns 
5 and 6 present the time spent for separating feasible set inequalities and the 
total time. The last column gives the sum of the gaps ( ) 
in percentage between the lower and upper bounds. Table 1 shows that the 
time slightly increases (by 2%), but the quality of the solutions is significantly 
improved, the gaps decrease by around 43% when adding feasible set inequalities. 
Based on these results we conclude that feasible set inequalities are a tool that 
helps solving mixed integer programs. 
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Abstract. The bin packing problem is one of the classical NP-hard op- 
timization problems. Even though there are many excellent theoretical 
results, including polynomial approximation schemes, there is still a lack 
of methods that are able to solve practical instances optimally. In this 
paper, we present a fast and simple generic approach for obtaining new 
lower bounds, based on dual feasible functions. Worst case analysis as 
well as computational results show that one of our classes clearly out- 
performs the currently best known “economical” lower bound for the bin 
packing problem by Martello and Toth, which can be understood as a 
special case. This indicates the usefulness of our results in a branch and 
bound framework. 



1 Introduction 

The bin packing problem (BPP) can be described as follows: Given a set of n 
“items” with integer size xi, . . . ,Xn, and a supply of identical “containers” of 
capacity C, decide how many containers are necessary to pack all the items. 
This task is one of the classical problems of combinatorial optimization and NP- 
hard in the strong sense - see Garey and Johnson [9]. An excellent survey by 
Goffmann, Garey, and Johnson can be found as Ghapter 2 in the recent book [2]. 

Over the years, many clever methods have been devised to deal with the re- 
sulting theoretical difficulties. Most notably, Fernandez de la Vega and Lueker [8], 
and Karmarkar and Karp [12] have developed polynomial time approximation 
schemes that allow it to approximate an optimal solution within 1 -|- £ in polyno- 
mial (even linear) time, for any fixed e. However, these methods can be hardly 
called practical, due to the enormous size of the constants. On the other hand, 
there is still a lack of results that allow it solve even moderately sized test prob- 
lems optimally - see Martello and Toth [15,16], and the ORLIB set of benchmark 
problems [1]. The need for better understanding is highlighted by a recent ob- 
servation by Gent [10]. He showed that even though some of these benchmark 

* This work was supported by the German Federal Ministry of Education, Science, 
Research and Technology (BMBF, Forderkennzeichen 01 IR 411 G7). 

R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 257-270, 1998. © Springer-Verlag Berlin Heidelberg 1998 
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problems of 120 and 250 items had defied all systematic algorithmic solution 
attempts, they are not exactly hard, since they can be solved by hand. 

This situation emphasizes the need for ideas that are oriented towards the 
exact solution of problem instances, rather than results that are mainly worst 
case oriented. In this paper, we present a simple and fast generic approach for 
obtaining lower bounds, based on dual feasible functions. Assuming that the 
items are sorted by size, our bounds can be computed in linear time with small 
constants, a property they share with the best lower bound for the BPP by 
Martello and Toth [15,16]. As it turns out, one of our classes of lower bounds 
can be interpreted as a systematic generalization of the bound by Martello and 
Toth. Worst case analysis as well as computational results indicate that our 
generalization provides a clear improvement in performance, indicating their 
usefulness in the context of a branch and bound framework. Moreover, we can 
show that the simplicity of our systematic approach is suited to simplify and 
improve other hand-tailored types of bounds. 

The rest of this paper is organized as follows. In Section 2, we give an intro- 
duction to dual feasible functions and show how they can be used to obtain fast 
and simple lower bounds for the BPP. In Section 3, we consider the worst case 
performance of one of our classes of lower bounds, as well as some computational 
results on the practical performance. Section 4 concludes with a discussion of 
possible extensions, including higher-dimensional packing problems. 



2 Dual Feasible Functions 

For the rest of this paper, we assume without loss of generality that the items 
have size Xi G [0,1], and the container size C is normalized to 1. Then we 
introduce the following: 

Definition 1 (Dual feasible functions). A function u : [0, 1] ^ [0, 1] is called 
dual feasible, if for any finite set S of nonnegative real numbers, we have the 
relation 



^a; < 1 '^u{x) < 1. (1) 

x^S x^S 

Dual feasible functions have been used in the performance analysis of heuris- 
tics for the BPP, first by Johnson [11], then by Lueker [13]; see Coffman and 
Lueker [3] for a more detailed description. The Term (which was first introduced 
by Lueker [13]) refers to the fact that for any dual feasible function u and for any 
bin packing instance with item sizes x\,. . . , Xn, the vector (u(xi), . . . , u{xn)) is a 
feasible solution for the dual of the corresponding fractional bin packing problem 
(see [12]). By definition, convex combination and compositions of dual feasible 
functions are dual feasible. 

We show in this paper that dual feasible functions can be used for improving 
lower bounds for the one-dimensional bin packing problem. This is based on the 
following easy lemma. 
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Lemma 1. Let I := {x\, . . . ,Xn) be a BPP instance and let u he a dual feasi- 
ble function. Then any lower hound for the transformed BPP instance u{I) := 
{u{x \), . . . , u{xn)) is also a lower bound for I. 

By using a set of dual feasible functions U and considering the maximum 
value over the transformed instances u{I), u G U, we can try to obtain even 
better lower bounds. 

In [13], a particular class of dual feasible functions is described; it relies on a 
special rounding technique. For a given k € IN, consider the stair function 
that maps (for i G {1, . . . , fc}) all values from the interval [j;^, onto the 
value p 1 is mapped to 1. We give a slightly improved version and a simple 
proof that these functions are dual feasible. 

Theorem 1. Let fc G IN. Then 

[ 0 , 1 ] ^[ 0 , 1 ] 

J X, for x(k + 1) G ^ 

^ I L(^ + e^se 

is a dual feasible function. 

Proof. Let S' be a finite set of nonnegative real numbers with ^ Xi < 1. We 

XiGS 

have to show that ^ u^^^Xi) < 1. Let T := {xi G S | Xi{k + 1) G Z}. Clearly, 

XiGS 

we only need to consider the case S ^ T. Then 
{k 1) u^'^^Xi) -k k ^ u^'^^Xi) = {k 1) Xt + ^ [(fc+l)xij 

XiGT XiGS\T Xi^T XiGS\T 

< (fc + 1) ^ Xi. 

Xi^S 

By definition of the terms (fc + 1) are 

integer, so by virtue of ^ < 1, we have the inequality (fc+ 1) (^) + 

XiGS 

^XxGS\T“*'^Ha^) S k, implying X <1. □ 

' xes 

The rounding mechanism is visualized in Figure 1, showing the difference of 
the stair functions fc G {!,..., 4}, and the identity id over the interval [0, 1]. 
For the BPP, we mostly try to increase the sizes by a dual feasible function, since 
this allows us to obtain a tighter bound. The hope is to find a for which 
as many items as possible are in the “win zones” - the subintervals of [0, 1] for 
which the difference is positive. 

The following class of dual feasible functions is the implicit basis for the 
bin packing bound L 2 by Martello and Toth [15,16]. This bound is obtained by 
neglecting all items smaller than a given value e. We account for these savings 
by increasing all items of size larger than 1 — e. Figure 2 shows the corresponding 
win and loss zones. 
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Fig. 1. Win and loss zones for 



Let e G [0, i 


|. Then 








[0,1]- 


[0,1] 










for 




X 1-^ 


{ a;, 


for 






lo. 


for 



X > 1 — e 
e < X <1 — € 
X < e 



is a dual feasible funetion. 



Proof. Let S' be a finite set of nonnegative real numbers, with X) ^ 

X^S 

consider two cases. If S contains an element larger than 1 — e, then all other 
elements have to be smaller than e. Hence we have ^ {x) =1. If all elements 

xGS 

of S have at most size 1 — e, we have ^ U^’^\x) < X] * — 1- 

x^S x^S 



Our third class of dual feasible functions has some similarities to some bounds 
that were hand-tailored for the two-dimensional and three-dimensional BPP by 




y = U® (X) 



Fig. 2. Win and loss zones for 
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Fig. 3. Win and loss zones for 



Martello and Vigo [17], and Martello, Pisinger, and Vigo [14]. However, our 
bounds are simpler and dominate theirs. 

This third class also ignores items of size below a threshold value e. For the 
interval (e, ^], these functions are constant, on (|, 1] they have the form of stair 
functions. Figure 3 shows that for small values of e, the area of loss zones for 
exceeds the area of win zones by a clear margin. This contrasts to the behavior 
of the functions and where the win and loss areas have the same size. 



Theorem 3. Let e G [0, ^). Then 






[ 0 , 1 ] 



X I— *■ 



[ 0 , 1 ] 

1 P-F 

lo. 



for X > ^ 
f’ur e < a; < i 
for X < e 



is a dual feasible function. 



Proof. Let S' be a finite set of nonnegative real numbers, with 
S' := {a; G S I e < a: < 1}. We distinguish two cases: 



^ a; < 1. Let 

x£S 



If all elements of S have size at most 1, then by definition of S', 



1 > 






x£S xGS' 



(2) 
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holds. Since |S"| is integral, it follows that |S"| < [e hence 

^ ^ = l^'l^ < 1. 

x£S xeS' ^ 

Otherwise S contains exactly one element y > ^ and we have 



1 > ^ X > y + ^ X >y + |S"|e. 

xGS xGS' 



( 3 ) 

( 4 ) 



Therefore |S"| < [(1 — y)e and hence 

^ ^ </.W(x) = 1 - 

xes xes' L J L J 



□ 



3 A Class of Lower Bounds for the BPP 

In this section, we show how dual feasible functions can be combined by virtue 
of Lemma 1 in order to get good bounds for the BPP. For this purpose, we 
consider the lower bound L 2 , suggested by Martello an Toth [15,16], which can 
be computed in time O(nlogn). According to the numerical experiments by 
Martello and Toth, L 2 yields very good approximations of the optimal value. 
We describe L 2 with the help of dual feasible functions. Using Lemma 1, it 
is straightforward to see that L 2 provides a lower bound for the BPP. Our 
description allows an easy generalization of this bound, which can be computed 
in linear time. We show that this generalized bound improves the asymptotic 
worst-case performance from | to |. Empirical studies provide evidence that 
also the practical performance is significantly improved. 



3.1 The Class 

* 

We give a definition of (asymptotic) worst case performance: 

Definition 2 ((Asymptotic) worst case performance). Let L be a lower 
hound for a minimization problem P. Let opt{L) denote the optimal value of P 
for an Lnstance L . Then 



r{L) := sup 



m 

opt{I) 



\I 



is an instance 




is called the worst case performance and 



(6) 



X 00 (.b) 



inf sup 



HI) 

opt{I) 



\L is an instance of P with opt{L) > s 



( 7 ) 



is called the asymptotic worst case performance of L. 
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The easiest lower bound for the BPP is the total volume of all items (the size 
of the container will be assumed to be 1), rounded up to the next integer. For a 
normalized BPP instance I := {xi, . . . , Xn), this bound can be formulated as 



Li{I) 



n 

^Xi . 

2=1 



( 8 ) 



Theorem 4 (Martello and Toth). r{Li) = 

Martello and Toth showed that for their ILP formulation of the BPP, the bound 
Li dominates the LP relaxation, the surrogate relaxation, and the Lagrange 
relaxation. (See [16], pp. 224.) According to their numerical experiments, Li 
approximates the optimal value very well, as long as there are sufficiently many 
items of small size, meaning that the remaining capacity of bins with big items 
can be exploited. If this is not the case, then the ratio between L\ and the 
optimal value can reach a worst case performance of r{Li) = |, as shown by the 
class of examples with 2k items of size 5 + 

In the situation which is critical for Li (“not enough small items to fill up 
bins”), we can make use of the dual feasible functions from Theorem 2; 
these functions neglect small items in favor of big ones. This yields the bound 
L 2 : For a BPP instance I, let 

L 2 (/) := max Li{U^"\l)). (9) 

£G[0.1] 

From Lemma 1, it follows immediately that L 2 is a lower bound for the BPP. 
The worst case performance is improved significantly: 

Theorem 5 (Martello and Toth). r{L 2 ) = |. 

This bound of | is tight: consider the class of BPP instances with 6k items of 
size i + 

There is little hope that a lower bound for the BPP that can be computed 
in polynomial time can reach an absolute worst case performance above | . Oth- 
erwise, the AfP-hard problem Partition (see [9], p. 47) of deciding whether two 
sets of items can be split into two sets of equal total size could be solved in 
polynomial time. 

L 2 can be computed in time O(nlogn). The computational effort is deter- 
mined by sorting the items by their size, the rest can be performed in linear time 
by using appropriate updates. 

Lemma 2 (Martello and Toth). Consider a BPP instance I := (xi, . . . , x„). 
If the sizes Xi are sorted, then L 2 {I) can he computed in time 0{n). 

Now we define for any k G IN 

Lf\l)-.= max Li («('=) o [/(<=)(/)) 

£e[o.i] 



( 10 ) 
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and for q >2 consider the following bounds: 

:= max{L 2 (I), max (11) 

k—2,...,q 

By Lemma 1, these are valid lower bounds. As for L 2 , the time needed for 
computing these bounds is dominated by sorting: 

Lemma 3. Let I := {x\, . . . , Xn) he an instance of the BPP. If the items Xi are 
given sorted by size, then can he computed in time 0(n) for any fixed 

q>2. 



3.2 An Optimality Result for 

(' 2 ') 

For the described class of worst case instances for L 2 , the bound Ll yields the 
optimal value. As a matter of fact, we have: 

Theorem 6 . Let I := {x \, . . . , Xn) he a BPP instance with all items larger than 
3 . Then L\ {!) equals the optimal value opt{I). 

Proof. Without loss of generality, let x\ > X 2 >■■■ > Xn- Consider an optimal 
solution, i. e., an assignment of all items to m := opt{I) bins. Obviously, any bin 
contains one or two items. In each bin, we call the larger item the lower item, 
while (in the bins with two items) the smaller item is called the upper item. In 
the following three steps, we transform the optimal solution into normal form. 

1. Sort the bins in decreasing order of the lower item. 

2. Move the upper items into the bins with highest indices. The capacity con- 
straint remains valid, since increasing bin index corresponds to decreasing 
size of the lower item. 

3. Using appropriate swaps, make sure that in bins with two items, increasing 
index corresponds to increasing size of the upper item. Again, the order of 
the bins guarantees that no capacity constraint is violated. 



Eventually, we get the normal form shown in Figure 4: Item Xi is placed in bin 
i. The remaining items m-|-l,TO-|-2, ...,n are placed in bins m,m — 1, . . . ,m — 
(n — (m -l- 1)). The first bin with two items has index q := 2m + 1 — n. 

For t G {1, . . . , n}, the assumption Xi > ^ implies 



«»>(*.) = 155 ^ > 1 



( 12 ) 



For n > 2m — 1, we get 



4")(/)>Li(u(2)(/)) 



i=l 



> 



(2m — 1) 
2 



= m. 



(13) 



Hence, the lower bound L 



(2) 

2 



(!) yields the optimal value, leaving the case 



n < 2m — 1. 



(14) 
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Fig. 4. Normal form of an optimal solution for a BPP instance with sizes Xi > 
1/3. 



Then q = 2m + 1 — n > 2m + 1 — (2m — 1) = 2, implying that at least bins 1 
and 2 contain only one item. 

If Xm > 5 , then Xi > ^ holds for all i < m, as the items are sorted by 
decreasing size, thus U^^^‘^\xi) = 1. Hence, 






2=1 



= m. 



(15) 



In this case, L 2 {I) equals the optimal value. Therefore, assume 

iCm < ^ (16) 

for the rest of the proof. 

If Xm-i + Xm > 1, let e := Xm < 1/2. For i < m — 1, we have Xi > Xm-i > 
1 — Xm = 1 — hence 

o U^'^^(xi) = = 1. (17) 

Furthermore, 1/3 < Xm <1/2 implies 

O [/(^^(Xm) = = ^- (18) 

All in all, we have 



> 



o u^'^^(xi) 



(m - 1 ) + 



(19) 



i. e., the bound meets the optimum. 
This leaves the case 



Xm—l “t” Xm < 1- 



( 20 ) 
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Fig. 5. Determining i* . 



Using the assumptions (14) and (20), we show that there is an i* G {2m — 
n, . . . , n} with 



Xi* + X2m—i*—l ^ 1 - (^^) 

Figure 5 shows the meaning of this statement: At least one of the upper items 
cannot be combined with the lower item two bins to the left. 

This is shown in the following way: If for all i* G {2m — n, . . . , n}, we had 

Xi X2m—i—l ^ 1 ; (^^) 



then all upper items could be moved two bins to the left, since the first two 
items do not contain more than one item by (14). This would allow it to pack 
item Xm-i with Xm, saving a bin. 

Now let e := X 2 m-i*-i- By (21), we have for all i G {1, . . . ,i*} that 



Xi > Xi* > 1 — e. 



hence 



O U^^\xi) = M^^^(l) = 1. 



For i G {i* + 1, . . . , 2m — 1 — i*} we have 



Xi ^ X2m—i* — l — ^5 



and therefore 

O U^'^\xi) > U^‘^\xi) > i. 

Summarizing, we get 



if a) > 



oU‘'^\xi)+ oU^^\xi) 

i—1 



(23) 

(24) 

(25) 

(26) 

(27) 
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> 



E 



i—l 



1 

2 



(2to - 2i* 




(28) 

(29) 



This completes the proof. 



□ 



3.3 Worst Case Performance of 

* 

As we have stated above, we cannot hope to improve the worst case performance 
of L 2 - However, we can show that the asymptotic worst case performance is 
improved by roo(T*^^): 

Theorem 7. 



r^{L^)=^. (30) 

Proof. Let / := {xi,. . . , x„) be a BPP instance. We start by showing 

max{L 2 (/),L^^^(/)} > ^opt(I) - 1. (31) 

By Theorem 6, all items with > | fit into m < L^\l) bins. Let these bins 
be indexed by l,...,m. Using the First Fit Decreasing heuristic, we add the 
remaining items, i.e., sort these items by decreasing order and put each item 
into the first bin with sufficient capacity; if necessary, use a new bin. Let q denote 
the number of bins in this solution. 

Suppose we need m bins for the big items and not more than y items for 
the rest, then Theorem 6 yields the first part of the statement: 

opt{I) <q< < ^max{L^^^(/),L 2 (/)}. (32) 

Therefore, assume 



4 



> m 



(33) 



for the rest of the proof. 

Let a denote the largest free capacity of one of the bins 1 through to, i.e., 
the total size of all items in this bin is at least (1 — a). 

No item that was placed in one of the bins to + 1 through q can fit into the 
bin with free capacity a. This means that all these bins can only contain items 
Xi > a. On the other hand, the size of these items does not exceed |. This 
implies that the bins to + 1 through q — 1 must contain at least three items of 
size Xi > a, while bin q holds at least one item Xi > a. 
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Thus, we get 

L2{I) > Ti(/) = 

Now consider two cases. 



i=l 



> |"(1 — a)m + 3a(g — 1 — m) + a] (34) 
= [(1 — 4a)m + ia{q — 1) + a] . 



Case 1: Assume a > Since (1 — 4a) < 0, we can replace the term m in 



(34) by ^ > m: 



L2{I) > (1 — 4a)^g + 3a(g — 1) + a 



3 

-q-2a 



> - 1 > ^opt{I) - 1. 



Case 2: For a < j, neglect the term (1 — 4a)m > 0 in (34): 
L 2 {I) > (3a(g — 1) + a] 

> ^9 - 1 > ‘^opt{I) - 1 . 



(35) 

(36) 

(37) 

(38) 

(39) 



This proves roo(T?^) > For showing that equality holds, consider the family of 
bin packing instances with 3/c items of size 1/4+ (5 with <5 > 0. This needs at least 
k bins. For sufficiently small S, we have = 0 and ^2(7) = Li(I) < jk+1. □ 



3.4 Computational Performance of 

Generally speaking, L 2 yields results that are orders of magnitude better than 
the worst case performance (see [15,16]). In the following, we compare the com- 
putational performance of +1'^^ on the same type of benchmark problems. As it 
turns out, we get a clear improvement. 

For our computational investigation, we generated random instances in the 
same way as described in [16], pp. 240. The bin size is normalized to 100. For a 
given number n of items, the sizes were generated randomly with even distribu- 
tion on the sets {1, . . . , 100}, {20, . . . , 100}, and {35, . . . , 100}. For each problem 
class, and n € {100,500,1000}, we generated 1000 instances. In the table, we 
compare L 2 with the bounds , and +1^°^ . Shown is the average relative 

error in percent (% err) and the number of instances, for which the optimal 
value was reached (^ opt). The optimum was computed with the routine MTP 
from [16] with a limit of 100000 search nodes. 

Especially for large instances, we see a clear improvement by the new bounds. 
Compared with L 2 , the number of times that the optimal value is met is increased 
by 50 % for n = 1000 and the first two problem classes. For the third problem 
class, we always get the optimum, as shown in Theorem 6. The differences be- 
tween and are significant, but small. 
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Table 1. Performance of lower bounds for the BPP. 









2 


L 


m 






Interval 


n 


% err 


# opt 


% err 


# opt 


% err # opt 


% err 


# opt 




100 


0.432 


774 


0.324 


832 


0.303 


843 


0.272 


859 


[1, 100] 


500 


0.252 


474 


0.157 


644 


0.154 


645 


0.130 


693 




1000 


0.185 


381 


0.116 


578 


0.114 


578 


0.100 


606 




100 


0.419 


732 


0.297 


812 


0.265 


832 


0.232 


853 


[20, 100] 


500 


0.231 


443 


0.144 


634 


0.138 


642 


0.123 


677 




1000 


0.181 


366 


0.104 


605 


0.103 


605 


0.091 


632 




100 


0.229 


827 


0.000 


1000 


0.000 


1000 


0.000 


1000 


[35, 100] 


500 


0.160 


553 


0.000 


1000 


0.000 


1000 


0.000 


1000 




1000 


0.114 


507 


0.000 


1000 


0.000 


1000 


0.000 


1000 



4 Conclusions 

We have presented a fast new method for generating lower bounds for the bin 
packing problem. The underlying method of dual feasible functions can also be 
used in the case of higher dimensions by combining our ideas the approach for 
modeling higher-dimensional orthogonal packings that we developed for find- 
ing exact solutions for the d-dimensional knapsack problem [4]. Details will be 
contained in the forthcoming papers [5,6,7]. 
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Abstract. We extend the theoretical foundations of the branch-and-cut 
method nsing lift-and-project cuts for a broader class of disjunctive con- 
straints, and also present a new, snbstantially improved disjunctive cut 
generator. Employed together with an efficient commercial MIP solver, 
our code is a robust, general pnrpose method for solving mixed integer 
programs. We present extensive computational experience with the most 
difficult problems in the MIPLIB library. 



1 Introduction 

Disjunctive programming is optimization over a finite union of convex sets. Its 
foundations were developed, and the term itself coined in the early seventies by 
Balas [4,5]; since then it attracted the attention of numerous researchers, includ- 
ing Jeroslow [18,19], Blair [12], Williams [25], Hooker [15], Beaumont [11], Sherali 
and Shetty [23], Meyer [21]. Besides having an elegant theory, disjunctive pro- 
gramming provides a way to formulate a wide variety of optimization problems, 
such as mixed integer programs, linear complementarity, job-shop scheduling, 
equilibrium problems, and so on. 

There is a natural connection between disjunctive programming problems 
and logic. In fact, a recent paper of Hooker and Osorio [16] proposes to solve 
discrete optimization problems that can be formulated by using logic and linear 
programming. They call this area of mathematical programming MLLP (Mixed 
Logical Linear Programming). Even though MLLP is, in principle, more general 

* Both authors were supported by NSF grant DMS 95-27-124 

R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 271-283, 1998. © Springer-Verlag Berlin Heidelberg 1998 
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than disjunctive programming, since it allows more flexibility in the representa- 
tion of logical formulas, every MLLP can be represented as a general disjunctive 
program. 

Recently, Balas, Ceria and Cornuejols [8,9] proposed the lift-and-project me- 
thod, which is related to the work of Balas on disjunctive programming, the 
matrix- cuts of Lovasz and Schrijver [20], the hierarchy of relaxations for mixed- 
integer programming of Sherali and Adams [22], and the intersection cuts of 
Balas [3]. The implementation of the lift-and-project method in a branch-and- 
cut framework (see [9]) proved to be very effective when tackling difficult mixed 
0-1 programs. 

The goal of our work is twofold. First, we show that the lift-and-project 
method for the 0-1 case can be extended quite naturally for a large class of 
disjunctions, called facial disjunctions. Such disjunctive constraints abound in 
practice, and our preliminary computational experience shows that - when the 
disjunctions are carefully chosen - the cuts generated from them outperform 0-1 
disjunctive cuts. 

Second, we attempt to answer the challenge posed by a new generation of 
commercial MIP solvers. We implemented a new, much improved lift-and-project 
cut generator, and we present our computational experience with it. Combining 
our separator with an efficient commercial code (CPLEX 5.0) in a cut-and- 
branch framework yields an extremely robust general purpose MIP solver. 

In the rest of this section we provide the basic notation and definitions. 
In Section 2 we give a brief overview of known properties of facial disjunctive 
programs, and describe how disjunctive cuts generated from facial disjunctions 
can be lifted. In Section 3 we discuss implementation issues and present new 
computational results on the problems of MIPLIB 3.0 with a substantially im- 
proved version of a lift-and-project cut generator. Finally, in Section 4 we give 
our conclusions, discuss ongoing work, and future directions. 

Disjunctive Sets and Disjunctive Cuts. A disjunctive set is a set of points 
satisfying a collection of inequalities connected by the logical connectors A (con- 
junction, “AND” ) and V (disjunction, “OR”). 

A disjunctive set is in Disjunctive Normal Form or DNF, if its terms do not 
contain further disjunctions. For simplicity, we shall be dealing with sets in DNF 
that contain only two terms, i.e. sets of the form 

Ko U iFi (1) 

Moreover, we shall assume that there is a polyhedron K that contains both Kq 
and K\, and Kq and Ki are defined by one additional inequality, that is 

K = {x\Ax>b} , . 

Kj = {x j Ax > b, d^x > gj} {j = 0, 1) ' 

We shall denote 



P = clconv {Kq U Ki) 



( 3 ) 
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A disjunctive set is in conjunctive normal form, or CNF if its conjunctions do 
not contain further conjunctions. E.g. if we are given the set K as above and 

Kij = {x\Ax > b, (F^x > gij} {i = I, . . . ,p, j = Q,l) 

then the set 

{x\ f\ (a; G ATjo V a: G ATji) } (4) 

2=l,...,p 

or, equivalently 

{ a; I Aa; > 5, j\ {F°x > g^o V ct^x > gn) } (5) 

2=1,. ..,p 

is in conjunctive normal form. 

As an example, consider the feasible set of a mixed 0-1 program. In this case 
K is the feasible set of the linear programming relaxation, and 

Kio = { a: I Ax > b, Xi < 0} 

Nil = {x \ Ax > b, Xi > 1} 

for (i = 1, . . . ,p). Then with this definition of Kio and Kn (5) is the usual way 
of expressing the feasible set of the mixed 0-1 program. Moreover, if for some i 
we choose 

Ko = Km, Ki = Kii (6) 

then the set in DNF in the strengthening of the LP-relaxation obtained by 
imposing the 0-1 condition on the variable Xi. 

A disjunctive program (DP for short) is an optimization problem with the 
feasible set being is a disjunctive set. Optimizing a linear function over the set 
in (1) can be done by optimizing over P. 

If 77 is a polyhedron, then we denote 

77* = {{a,(}) I ax > /7 is a valid inequality for 77} 

The next theorem, due to Balas, provides a representation of the set P in (1). 
This representation will be used in the next section as the basis for disjunctive 
cutting plane generation. 

Theorem 1 (Balas [4]). Assume that the sets Kq and Ki are nonempty. Then 
(a,/3) G P* if and only if there exists u^,v^,u^,v^ such that 

u^A +v^d^ = ot {j = 0, 1) 
u^b +vf>gj > (3 {j = 0, 1) 
uCv^ > 0 (j = 0, 1) 



( 7 ) 
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In fact, this result holds under a more general regularity condition, than the 
nonemptyness of all KiS (assuming nonemptyness, the result follows by simply 
using Farkas’ lemma). 

The lift-and-project method is based on the generation of disjunctive or lift- 
and-project cuts, ax > j3 which are valid for P, and violated by the current 
LP-solution X, i.e. ax < j3. Theorem 7 provides a way of generating disjunctive 
cuts for a general disjunctive program through the solution of a linear program, 
called the cut- generation LP, of the form 

max (3 — ax 

s.t. (a,/3) G P* (8) 

{a,P)GS 

where S' is a normalization set ensuring boundedness of the CLP. 

There are several possible choices for the set S (see [13] for a complete de- 
scription). In our current implementation we use the following normalization 
constraint: 

1 

S = {(m°, v^, u^,v^) : '^{u^ + v^)'^e < 1} 
j=o 

where e is a vector of all ones of appropriate dimension. 

2 Facial Disjunctions 

Our purpose is to use disjunctive cuts for solving general disjunctive programs. 
We will devote particular attention to a special class of facial disjunctive pro- 
grams. A disjunctive set is called facial if the sets Kg and Ki are faces of K. 
Some examples of facial and non-facial disjunctive programs include: 

• Variable upper bound constraints: Suppose that we wish to model the 
following situation: if an arc t in a network is installed, we can send up to Ui 
units of flow on it; if it is not installed, we can send none. If we denote by yi 
the amount of flow on the arc, and Xi is a 0-1 variable indicating whether 
the arc is installed, or not, then clearly, 

x^ = I y = 0 , . 

0<y,<u 

is a correct model. 

• Linear complementarity: The linear complementarity problem (LCP for 
short) is finding x, z satisfying 

x,z > 0 

Mx -\- z = q (10) 

= 0 

Clearly, (10) is a facial disjunctive program in CNF, with the disjunctions 
being Xi = 0 V Zj = 0. 
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• Ryan-Foster disjunctions: This disjunction is used as a very successful 
branching rule in solving set-partitioning problems. Precisely, suppose that 
the feasible set of an SPP is given as 

Xk G {0, IjVfc 
Ax = e 

with A being a matrix of zeros and ones, and e the vector of all ones. Denote 
by Ri the support of the row of A. Then the disjunction 

y^a:fc = lvy^a:fc = 0 

keR keR 

is valid for all the feasible solutions of the SPP, if i? is a subset of any Ri. 
However, if R is chosen as the intersection of Ri^ and Ri^ for two rows i\ 
and Z 2 , then the disjunction will perform particularly well as a branching 
rule, when solving the SPP by branch-and-bound. 

• Machine scheduling (see [6]): In the machine scheduling problem we 
are given a number of operations {1, . . . , n} with processing times pi, ... ,pn 
that need to be performed on different items using a set of machines. The 
objective is to minimize total completion time while satisfying precedence 
constraints between operations and the condition that a machine can process 
one operation at a time, and operations cannot be interrupted. The problem 
can be formulated as 

Min tn 

tj ti ^ di, {i, j) G z nil 

U>0, i = {l,...,n} 
tj — ti>di\/ti — tj > dj (i,j) G W 

where ti is the starting time of job i, Z is the set of pairs of operations 
constrained by precedence relations and W is the set of pairs that use the 
same machine and therefore cannot overlap. 

This disjunctive program is NOT facial. 

One of the most important theoretical properties of 0-1 disjunctive cuts is 
the ease with which they can be lifted ([8,9]). For clarity, recall that 

K = {x\ Ax > b} 

Kj = {x \ X G K, dM > pj} {j = 0, 1) 

P = clconv {Kq U Ki) 

Let K' be a face of K, and 

KI = {x\xG K', dM > gj} (j = 0, 1) 

P' = clconv (K'gUK[) 

Suppose that we are given a disjunctive cut {a' ,13') valid for P' and violated by 
X G K' .Is it possible to quickly compute a cut {a, (3), which is valid for P, and 
violated by a: ? 



(12) 



(13) 
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The answer is yes, if {a' ,(3') is a cut obtained from a 0-1 disjunction ([8,9]), 
and K' is obtained from K by setting several variables to their bounds. Moreover, 
it is not only sufficient to solve the CLP with the constraints representing K' 
in place of K, its size can be reduced by removing those columns from A which 
correspond to the variables at bounds. In practice these variables are the ones 
fixed in branch-and-cut, plus the ones that happen to be at their bounds in the 
optimal LP-solution at the current node. Cut lifting is vital for the viability of 
the lift-and-project method within branch-and-cut; if the cut generation LP’s 
are solved with putting all columns of A into the CLP, the time spent on cut 
generation is an order of magnitude larger, while the cuts obtained are rarely 
better ([9]). The main result of this section is : 

Theorem 2. Let K' he a an arbitrary face of K and Kj and P' as above. Let 
{a' ,13') € {P')* with the corresponding multipliers given. Then we can compute 
a cut (a, /?) that is valid for P and for all x € K' 

ax — (3 = a' X — (3' (14) 

Proof. Somewhat surprisingly, our proof is even simpler, than the original for 
the 0-1 case. Let K' be represented as 

K' = {x \ A^x = A+x > 6+ } 

where the systems A^x > 6= and A^x > 6+ form a partition of Ax > b. Since 
(a',P') G {P')*, we have 

a = ufA^ + ufAj^ + VqS 

= u^^A^ + u\A_^. + Vit , s 

(3 < u°J)= + 6+ -I- VqSq 

(3 < + u\b^ + vAo 

with uf > 0, > 0,Uo > 0,Ui > 0, and uf and unconstrained. Then 

if we replace the negative components of uf and by 0, and compute the 
corresponding (a,/3) it will clearly be valid for P and satisfy (14). □ 

It is interesting to note that the above proof also implies that if the CLP 
is solved with a normalization that is imposed only on (3, such as /? = ±1, or 
\(3\ < 1, then the lifted cut will also be optimal. Hence the above theorem also 
generalizes the cut-lifting theorem in [8]. 

Suppose that after possibly complementing variables, multiplying rows by a 
scalar, adding rows, and permuting columns A^ and 6= can be brought into the 
form 

= [/, 0 ] , 6 = = 0 

This condition is satisfied for most facial disjunctions of importance. Then just 
as in the 0-1 case, we can remove the columns from A that correspond to the 
columns of /, and solve the cut generation LP in the smaller space. 

The consequence of these results is that facial disjunctions can be used for 
branching in a branch-and-cut algorithm, in place of the usual 0-1 branching. At 
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any given node of the tree, the LP-relaxation is always a system that arises from 
the system defining K by imposing equality in some valid inequalities. In other 
words, the LP-relaxation at any given node defines a face of the LP-relaxation at 
the root. Therefore, if we also wish to generate disjunctice cuts, this can always 
be done using the LP-relaxation at the current node of the branch-and-cut tree, 
then lifting the resulting cut to be globally valid. Notice, that for this scheme to 
work, we only require the disjunctions for branching to be facial; the disjunctions 
for cutting can be arbitrary. 

3 Computations 

3.1 The Implementation 

The computational issues that need to be addressed when generating lift-and- 
project cuts were thoroughly studied in [9] . Their experience can be briefly sum- 
marized as: 

• It is better to generate cuts 

• in large rounds, before adding them to the linear programming relax- 
ation, and reoptimizing. 

• in the space of the variables which are strictly between their upper and 
lower bounds, then to lift the cut to the full space. 

• The distance of the current fractional point from the cut hyperplane is a 
reliable measure of cut quality. 

We adopted most of their choices, and based on our own experience, we added 
several new features to our code, namely, 

• In every round of cutting, we choose 50, (or less, if fewer are available) 0-1 
disjunctions for cut generation. In an analogous way, we also choose a set of 
general integer disjunctions of the form Xi < \xi\ \J Xi> \xi\. 

• The 50 0-1 disjunctions are chosen from a candidate set of 150. It is rather 
conceivable that a disjunction will give rise to a strong cut if and only if 
it would perform well when used for branching; i.e. the improvement of 
the objective function on the two branches would be substantial. Therefore, 
we use “branching” information to choose the variables from the candidate 
set. In the current implementation we used the strongbranch routine of 
CPLEX 5.0, which returns an estimate of the improvements on both gener- 
ated branches. Our strategy is testing the 150 candidate variables (or fewer 
if less are available) then picking those 50 which maximize some function 
(currently we use the harmonic mean) of the two estimates. We call this 
procedure the strong choice of cutting variables. We then repeat this process 
for the general integer variables, if any. We are in the process of testing other 
rules commonly used for selecting branching variables, like pseudo-costs and 
integer estimates. 

• We pay particular attention to the accuracy of the cuts. If the cuts we derive 
are numerically unstable we resolve the CLP with a better accuracy. 
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• We use a normalization constraint (the set S in CLP) that bounds the sum 
of all multipliers (ubu*). 

• In joint work with Avella and Rossi [1], we have chosen to generate more 
than one cut from one disjunction using a simple heuristic. After solving 
the CLP, we fix a nonzero multiplier to zero, then resolve. We repeat this 
procedure several times always checking whether the consecutive cuts are 
close to being parallel; if so, one of them is discarded. 



3.2 The Test-Bed and the Comparison 

As a benchmark for comparison, we used the commercial MIP solver CPLEX 
5.0, with the default parameters. 

As the testbed, we used problems from MIPLIB 3.0, a collection of publicly 
available mixed integer programming problems. We excluded those problems 
which were too easy for CPLEX, namely the ones that could be solved within 
100 nodes, and also the fast0507 problem, since it is too large. We divided the 
remaining problems into two groups. 

• “Hard” problems; the ones that cannot be solved within one thousand sec- 
onds by CPLEX with the default setting. 

• “Medium” problems. All the rest. 

Finally, since the enumeration code of MIPO was written 4 years ago, currently 
it is not competitive with the best commercial solvers. Therefore, we tested our 
cut-generator in a cut-and-branch framework. We generated 2 and 5 rounds of 
50-100 cuts, after every round adding them to the LP formulation, reoptimizing, 
and dropping inactive constraints. After the fifth round we fed the strengthened 
formulation to the CPLEX 5.0 MIP solver with the above setting. Also, all the 
cut-generation LP’s were solved using the CPLEX dual simplex code. 

All of our tests were performed on a Sun Enterprise 4000 with 8-167MHz 
CPU; we set a memory limit of 200 MB, and ran all our tests using one processor 
only. 

3.3 The Computational Results 

The results for those “hard” problems which could be solved with, or without 
cuts, are summarized in Table 2. Their description is included in Table 1. The 
problems not solved by any of the two methods are: danoint, danoSmip, noswot, 
setlch, Seymour. Nevertheless, on the last two problems lift-and-project cuts 
perform quite well; setlch can be solved with 10 rounds, and on seymour we 
were able to get the best bound known to date (see the next section). 

Also, the medium problems were run, and solved by CPLEX and cut-and- 
branch as well. The comparisons for these problems are not presented here, but 
cut-and-branch was roughly twice as fast if we aggregate all the results. 
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Table 1. Problem description. 



Problem 


LP value 


IP value 


lOteams 


897.00 


904.00 


air04 


55,264.43 


55,866.00 


air05 


25,877.60 


26,374.00 


arkiOOl 


7,009,391.43 


7,010,963.84 


gesa2 


25,476,489.68 


25,781,982.72 


gesa2_o 


25,476,489.68 


25,781,982.72 


harp2 


-74,325,169.35 


-73,893,948.00 


misc07 


1415.00 


2,810.00 


modOll 


-62,121,982.55 


-54,558,535.01 


modglob 


20,430,947.62 


20,740,51 


p6000 


-2,350,838.33 


-2,349,787.00 


pkl 


0.00 


11.00 


pp08a 


2748.35 


7350.00 


ppOSaCUTS 


5480.61 


7350.00 


qiu 


-931.64 


-132.87 


rout 


-1393.38 


-1297.69 


setlch 


30426.61 


49,846.25 


vpm2 


9.89 


13.75 



The following preliminary conclusions can be drawn. 

(1) In 9 problems out of the 22, either 2, or 5 rounds (in most cases 5) of lift- 
and-project cuts substantially improve the solution time. In 5 problems the 
difference is “make-or-break” ; between solving, or not solving a problem, or 
improving the solution time by orders of magnitude. 

(2) On 6 problems, our cuts do not make much difference in the solution time. 

(3) On 3 problems, adding our cuts is actually detrimental. It is important to 
note, that the deterioration in the computing time is not due to the time 
spent on generating the cuts, rather to the fact, that they make the linear 
programming relaxation harder to solve. In fact, it is most likely possible to 
catch this effect by monitoring, e.g. the density of the cuts, and the deteri- 
oration of the LP relaxation’s condition number. 

Lift-and-Project Cuts on Two Very Difficult Problems. There were 5 
problems that neither CPLEX alone, nor our code (with at most 5 rounds of cuts) 
was able to solve. These are : danoint, danoSmip, noswot, setlch and seymour. 
All of them are notoriously hard, and currently unsolvable by general purpose 
MIP-solvers within a reasonable time. In fact, danoSmip, noswot, and seymour 
have never been solved to optimality (although an optimal value for noswot is 
reported in MIPLIB 3.0, we could not find anyone to confirm the existence of 
such a solution). 

Our cuts do not perform well on the first 3 problems; danoint and danoSmip 
are network design problems with a combinatorial structure, already containing 
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Table 2. Computational results for cut-and-branch. 



Problem 


CPLEX 5.0 


Cut— and— Branch 2 


Cut— and— Branchs j 




Time 


Nodes 


Time 


Nodes 


Time 


Nodes 


lOteams 


5404 


2265 


1274 


306 


5747 


1034 


air04 


2401 


146 


1536 


110 


5084 


120 


air05 


1728 


326 


1411 


141 


4099 


213 


arkiOOl 


6994 


21814 


18440 


68476 


13642 


12536 


gesa2 


9919 


86522 


3407 


22601 


1721 


6464 


gesa2_o 


12495 


111264 


4123 


28241 


668 


4739 


harp2 


14804 


57350 


10686 


28477 


13377 


31342 


misc07 


2950 


15378 


2910 


12910 


4133 


14880 


modOll 


22344 


18935 


63481 


24090 


-f-f-1 


++-f 


modgfob 


-f-l+ 


+-f-f 


10033 


267015 


435 


5623 


p6000 


1115 


2911 


1213 


2896 


805 


1254 


pkl 


3903 


130413 


5094 


122728 


6960 


150243 


pp08a 


-f-1-1 


+-f-f 


1924 


47275 


178 


1470 


pp08aCUTS 


50791 


1517658 


277 


3801 


134 


607 


qiu 


35290 


27458 


15389 


10280 


27691 


15239 


rout 


19467 


133075 


26542 


155478 


40902 


190531 


vpm2 


8138 


481972 


1911 


63282 


974 


18267 



many special purpose cuts, and noswot is highly symmetric. However, disjunctive 
cuts perform strikingly well on the last two instances. 

Until now, setlch could be solved to optimality only by using special pur- 
pose path-inequalities [26]. After exhausting the memory limits, CPLEX could 
only find a solution within 15.86 % of the optimum. We ran our cutting plane 
generator for 10 rounds, raising the lower bound to within 1.4 % of the integer 
optimum. CPLEX was then able to solve the strengthened formulation in 28 
seconds by enumerating 474 nodes. It is important to note that no numerical 
difficulties were encountered during the cutting phase, (even if we generated 15 
rounds, although this proved unnecessary) and the found optimal solution pre- 
cisely agrees with the one reported in MIPLIB (the objective coefficients are 
one- fourth integral). 

The problem seymour is an extremely difficult setcovering instance; it was 
donated to MIPLIB by Paul Seymour, and its purpose is to find a minimal 
“irreducible configuration” in the proof of the four-colour conjecture. It has not 
been solved to optimality. The value of the LP-relaxation is 403.84, and an integer 
solution of 423.0 is known. The best previously known lower bound of 412.76 
[2] was obtained by running CPLEX 4.0 on an HP SPP2000 with 16 processors, 
each processor having 180 MHz frequency, and 720 Mflops peak performance, for 
the total of approximately 58 hours and using approx. 1360 Mbytes of memory. 

Due to the difficulty of the problem, we ran our cutting plane algorithm on 
this problem with a rather generous setting. We generated 10 rounds of cuts, in 
each round choosing the 50 cutting variables picked by our strong choice from 
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among all fractional variables with the iteration limit set to 1000. The total time 
spent on generating the 10 rounds was approximately 10.5 hours, and the lower 
bound was raised to 413.16. The memory useage was below 50Mbytes. Running 
CPLEX 4.0 on the strengthened formulation for approximately 10 more hours 
raised the lower bound to 414.20 - a bound that currently seems unattainable 
without using our cuts. 



Computational Results with Other Disjunctions. We ran our cut-genera- 
tor on two of the most difficult set-partitioning problems in MIPLIB, namely 
air 04 and air 05, by using the Ryan-Foster disjunctions described in the previous 
section. The results with two rounds of cuts are summarized in Table 3. 



Table 3. Results with the Ryan-Foster disjunctions. 



Problem 


CPLEX 5.0| 
Time 


CPLEX 5.0 
Nodes 


C&B 

Time 


C&B 

Nodes 


air04 


2401 


146 


1300 


115 


air05 


1728 


326 


1150 


123 



4 Conclusions and Future Directions 

In the near future we plan to explore the following topics: 



• Making our computational results with cut-and-branch more consistent. The 
key here is, finding the right amount of cutting, that sufficiently strengthens 
the LP-relaxation, but does not make it too difficult to solve. 

• We are currently implementing a branch-and-cut method that uses disjunc- 
tive cuts and allows branching on facial disjunctions, using cut lifting based 
on Theorem 2. We will use the disjunctions which are given as part of the 
formulation, and in some other cases, we will use the structure of the problem 
to generate other valid disjunctions. Our goal is to treat disjunctions in a way 
similarly to inequalities (that is, to maintain a set of “active” disjunctions, 
and to keep the rest in a “pool”), and handle them efficiently throughout 
the code. 

• We are in the process of testing our cutting plane generator with other 
commercial LP and MIP solvers (XPRESS-MP). This program allows for 
the generation of cutting planes within the enumeration tree without the 
need of programming our own enumeration, and hence improving on the 
efficiency. 
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Abstract. In this paper, we consider a class of 0-1 programs which, 
although innocent looking, is a challenge for existing solution methods. 
Solving even small instances from this class is extremely difficult for 
conventional branch-and-bound or branch-and-cut algorithms. We also 
experimented with basis reduction algorithms and with dynamic pro- 
gramming without much success. The paper then examines the perfor- 
mance of two other methods: a group relaxation for 0,1 programs, and 
a sorting-based procedure following an idea of Wolsey. Although the re- 
sults with these two methods are somewhat better than with the other 
four when it comes to checking feasibility, we offer this class of small 0,1 
programs as a challenge to the research community. As of yet, instances 
from this class with as few as seven constraints and sixty 0-1 variables 
are unsolved. 



1 Introduction 

Goal programming [2] is a useful model when a decision maker wants to come “as 
close as possible” to satisfying a number of incompatible goals. It is frequently 
cited in introductory textbooks in management science and operations research. 
This model usually assumes that the variables are continuous but, of course, it 
can also arise when the decision variables must be 0,1 valued. As an example, 
consider the following market-sharing problem proposed by Williams [18]: A 
large company has two divisions D\ and D 2 - The company supplies retailers 
with several products. The goal is to allocate each retailer to either division D\ 
or division D 2 so that D\ controls 40% of the company’s market for each product 
and D 2 the remaining 60% or, if such a perfect 40/60 split is not possible for 
all the products, to minimize the sum of percentage deviations from the 40/60 
split. This problem can be modeled as the following integer program (IP): 

* This work was supported in part by NSF grant DMI-9424348. 

** Part of the work was done while this author was affiliated with Carnegie Mellon 
University. 

R. E. Bixby, E. A. Boyd, and R. Z. Rios-Mercado (Eds.): IPCO VI 
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Min Er=ik*l 
s-t- + Si = 

Xj €{0,1} for j = 

Si free for i = 

where n is the number of retailers, m is the number of products, atj is the demand 
of retailer i for product j, and the right hand side vector bt is determined from 
the desired market split among the two divisions Di and D 2 - Note that the 
objective function of IP is not linear but it is straightforward to linearize it. 
This integer program also models the following basic Feasibility Problem (FP) 
in geometry: 

Feasibility Problem: Given m hyperplanes in 5ft”, does there exist a point 
X € {0,1}" which lies on the intersection of these m hyperplanes? 

If the optimum solution to IP is 0, the answer to FP is “yes”, else the answer 
to FP is “no”. Clearly, FP is NP-complete since for m = 1, FP is the subset- 
sum problem which is known to be NP-complete [6] . Problems of this form can 
be very difficult for existing IP solvers even for a relatively small number n of 
retailers and number m of products (e.g. n = 50, m = 6, and uniform integer 
demand between 0 and 99 for each product and retailer). More generally, with 
this choice of a^- , asking for a 50/50 split and setting n = 10(m — 1) produces a 
class of hard instances of 0-1 programs for existing IP solvers. 

In this paper, we consider instances from the above class generated as follows: 
Qij uniform integer between 0 and 99 {= D — 1), n = 10(m — 1) and bi = 
Ih J^j=i >^ore generally, in the range [^{-D + «b)J + 

Z)J=1 ^ij)\ + D - 1. 

2 Available Approaches for Solving IP 

In this section, we report on our computational experience with four different 
IP solvers available in the literature. 



2.1 Branch and Bound 

We found that even small instances of IP are extremely hard to solve using the 
conventional branch-and-bound approach. We offer the following explanation. 
For instances chosen as described above, there is often no 0-1 point in the inter- 
section of the hyperplanes, that is the optimum solution to IP is strictly greater 
than 0, whereas the solution to the linear programming relaxation is 0, even 
after fixing numerous variables to 0 or to 1. Because the lower bound stays at 
0, nodes of the branch-and-bound tree are not pruned by the lower bound until 
very deep into the enumeration tree. We illustrate this point in Table 1. We 
generated 5 instances of IP (each having 30 variables and 4 constraints) using 
the setup described above. We indicate the number of nodes enumerated to solve 
IP using CPLEX 4.0.3. For each of the 5 instances, the number of nodes enu- 
merated by the branch-and-bound tree is greater than Note that, in each 
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Table 1. Size of the branch-and-bound tree (5 instances). 



Problem 

size 


Number of nodes 
enumerated 


Optimal 

solution 


4 X 30 


1224450 


1.00 


4 X 30 


1364680 


2.00 


4 X 30 


2223845 


3.00 


4 X 30 


1922263 


1.00 


4 X 30 


2415059 


2.00 



case, the solution to IP is strictly greater than 0. Instances with 40-50 variables 
take several weeks before running to completion. 

Note: The problem class IP is related to the knapsack problems considered by 
Chvatal [3]. It is shown in [3] that these knapsack problems are hard to solve 
using branch-and-bound algorithms. However, the coefficients of the knapsack 
constraint are required to be very large (17[1, 10 s']). For the instances in the 
class IP that we consider, the coefficients Uij are relatively small (17[0,99]). By 
combining the constraints of IP with appropriate multipliers (e.g. multiplying 
constraint i by (jiDY~^) and obtaining a surrogate constraint, we get an equiva- 
lent problem by choosing D large enough, say D = 100 in our case. The resulting 
class of knapsack instances is similar to that considered in [3] . 



2.2 Branch and Cut 

The idea of branch-and-cut is to enhance the basic branch-and-bound approach 
by adding cuts in an attempt to improve the bounds. Here, we used MIPO, 
a branch-and-cut algorithm which uses lift-and-project cuts [1]. The computa- 
tional results were even worse than with the basic branch-and-bound approach 
(see Table 2). This is not surprising since, in this case, the linear programming 
relaxation has a huge number of basic solutions with value 0. Each cutting plane 
cuts off the current basic solution but tends to leave many others with value 
0. As we add more cuts, the linear programs become harder to solve and, over- 
all, time is wasted in computing bounds that remain at value 0 in much of the 
enumeration tree. 



2.3 Dynamic Programming 

Using the surrogate constraint approach described above, we get a 0-1 knap- 
sack problem which is equivalent to the original problem. Clearly, this technique 
is suitable for problems with only a few constraints. Dynamic programming 
algorithms can be used to solve this knapsack problem. Here, we use an imple- 
mentation due to Martello and Toth [13] pages 108-109. The complexity of the 
algorithm is 0(min(2”+^, nc)) where c is the right-hand-side of the knapsack 
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constraint. For the instances of size 3 x 20 and 4 x 30, we used the multiplier 
{nDy~^ with D = 100 for constraint i to obtain the surrogate constraint. For 
the larger instances, we faced space problems: We tried using smaller multipliers 
(e.g. {nD with D = 20) but then the one-to-one correspondence between 
solutions of the original problem and that of the surrogate constraint is lost. So 
one has to investigate allthe solutions to the surrogate constraint. Unfortunately, 
this requires that we store all the states of the dynamic program (the worst case 
bound for the number of states is 2”). Hence, even when we decreased the multi- 
pliers, we faced space problems. See Table 2. We note that there are other more 
sophisticated dynamic programming based procedures [14] which could be used 
here. Hybrid techniques which use, for example, dynamic programming within 
branch-and-bound are also available [13]. 



2.4 Basis Reduction 



For the basis reduction approach, we report results obtained using an imple- 
mentation of the generalized basis reduction by Xin Wang [16] [17]. It uses the 
ideas from Lenstra [11], Lovasz and Scarf [12] and Cook, Rutherford, Scarf and 
Shallcross [4]. We consider the feasibility question FP here. Given the polytope 
P = {0 < X < 1 : Ax = b}, the basis reduction algorithm either finds a 0,1 
point in P or generates a direction d in which P is “flat” . That is, maxjdx — dy : 
x,y G P} is small. Without loss of generality, assume d has integer coordinates. 
For each integer t such that [min{(ix : x G P}] < t < [maxjdx : x G P}J the 
feasibility question is recursively asked for P D {x : dx = t}. The dimension of 
each of the polytopes P C\ {x : dx = t} is less than the dimension of P. Thus, 
applying the procedure to each polytope, a search tree is built which is at most 
n deep. A direction d in which the polytope is “flat” is found using a generalized 
basis reduction procedure [12] [4]. 



2.5 Computational Experience 



Table 2 contains our computational experience with the different approaches 
given in Section 2. We choose 4 settings for the problem size: to x n = 3 x 20, 
4 X 30, 5 X 40 and 6 x 50. For each of these settings, we generated 5 instances 
as follows: random integers chosen uniformly in the range [0,99] and bi = 

aij)/2\. For branch-and-bound, we use the CPLEX 4.0.3 optimizer. For 
branch-and-cut, we use MIPO [1]. For dynamic programming, we use DPS [13]. 
For basis reduction, we use Wang’s implementation [16]. Times reported refer to 
seconds on an HP720 Apollo desktop workstation with 64 megabytes of memory. 
None of the problems with 40 or 50 variables could be solved within a time limit 
of 15 hours. 
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Table 2. Computing times for various solution procedures. 



Problem 


B & B 


B & C 


DP 


Basis Red. 


size (m x n) 


CPLEX 4.0.3 


MIPO 


DPS 


Wang 


3 X 20 


11.76 


213.62 


0.93 


300.42 


3 X 20 


13.04 


125.47 


1.28 


209.79 


3 X 20 


13.16 


208.76 


0.94 


212.03 


3 X 20 


13.64 


154.71 


1.31 


277.41 


3 X 20 


11.21 


190.81 


1.11 


197.37 


4 X 30 


1542.76 


*** 


20.32 


*** 


4 X 30 


1706.84 


*** 


20.37 


*** 


4 X 30 


2722.52 


*** 


20.31 


*** 


4 X 30 


2408.84 


*** 


18.43 


*** 


4 X 30 


2977.28 


*** 


18.94 


*** 


5 X 40 


*** 


*** 


+++ 


*** 


5 X 40 


*** 


*** 


+++ 


*** 


5 X 40 


*** 


*** 


+++ 


*** 


5 X 40 


*** 


*** 


+++ 


*** 


5 X 40 


*** 


*** 


+++ 


*** 


6 X 50 


*** 




+++ 


*** 


6 X 50 


*** 


*** 


+++ 


*** 


6 X 50 


*** 


*** 


+++ 


*** 


6 X 50 


*** 


*** 


+++ 


*** 


6 X 50 


*** 


*** 


+++ 


*** 



** Time limit (54000 seconds) exceeded. 
+++ Space limit exceeded. 



3 Two Other Approaches 

In this section, we introduce two other approaches to FP. 

3.1 The Group Relaxation 

The feasible set of FP is 



S = {x G {0, 1}” : Ax = b} 

where (T, b) is an integral m x (n+ 1) matrix. We relax S to the following group 
problem (CP). 



= {a; G {0, 1}" : Ax = b (mod <5)} 

where <5 G Z™. GP is interesting because (i) In general, GP is easier to solve 
than IP [15] and (ii) Every solution of FP satisfies GP. The feasible solutions to 
GP can be represented as s-t paths in a directed acyclic layered network G. The 
digraph G has a layer corresponding to each variable xj , j G IV, a source node s 
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and a sink node t. The layer corresponding to variable Xj has i5i x <52.-- x 6m nodes. 
Node where k = {ki, km), fci = 0, . . . , i5i — 1 for i = 1, . ■ ■ ,m, can be reached 
from the source node s if variables a;i, a;2, xj can be assigned values 0 or 1 such 
that = ki (mod Si), i = 1,2, ..., m. When this is the case, node has 

two outgoing arcs (j^, (j + 1)^) and (j^, (j + 1)^ ), where fc' = fci + aij+i(mod Si), 
corresponding to setting variable Xj+i to 0 or to 1. The only arc to t is from 

node (mod S) ^ digraph G has N = 2 + n y. S\... y. Sm nodes and at 

most twice as many arcs. 

Example 2: Consider the set 

S' = {x G {0, 1}^ : 3xi + 2x2 + 3x3 + 4x4 = 5 

6xi + 7x2 + 3x3 + 3x4 = 10} 

Corresponding to the choice Si = 2, i = 1,2, we have the following group 
relaxation 

S 22 = {x G {0, 1}^ : Ixi + 0x2 + 1x3 + 0x4 = 1 (mod 2) 

Oxi + 1x2 + 1x3 + 1x4 = 0 (mod 2)} 

Figure 1 gives the layered digraph G for this relaxation. 




Fig. 1. The layered digraph, G, for Example 2. 



Every s-t path in G represents a feasible 0-1 solution to S22. What is more 
important is that every feasible solution to FP is also a s-t path in G. But this 
relationship is not reversible. That is, an s-t path in G may not be feasible for IP. 
Also, G may contain several arcs which do not belong to any s-t path. Such arcs 
can be easily identified and discarded as follows: Among the outgoing arcs from 
nodes in layer n — 1, we only keep those arcs which reach the node n** 
and delete all other arcs. This may introduce paths which terminate at layer 
n — 1. Hence, going through the nodes in layer n — 2, we discard all outgoing 
arcs on paths which terminate at layer n — 1, and so on. It can be easily seen 
that performing a “backward sweep” in G in this manner, in time 0{N), we 
get a new graph G which consists of only the arcs in solutions to the group 
relaxation. For the graph G in Figure 1, the graph G is shown in Figure 2. 
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Fig. 2. The solution digraph, G , for Example 2. 



For each s-t path in G' , we check whether it corresponds to a feasible solution of 
FP. Thus, a depth-first-search backtracking algorithm solves FP in time 0{pmn) 
where p is the number of s-t paths in G' . Two issues are crucial with regard to 
the complexity of this approach: the size of p and the size of the digraph G . A 
simple estimate shows that the expected value of p is when the data 

Gij are uniformly and independently distributed between 0 and D — 1, the Si’s 
divide D and the bi’s are uniformly and independently distributed in the range 
-D-|-X]j=i to [i(— aij)J -I-Z9 — 1. On the other hand, the size 

of G is of the order n x Si... x Sm- The two issues, namely the size of the digraph 
G and the number of solutions to the group relaxation, are complementary to 
each other. As the size of G increases, the number of solutions to the group 
relaxation decreases. The best choice is when these two sizes are equal, that is 
i5i X (52 X . . . (5m « Then, under the above probabilistic assumptions, the 
expected complexity of the group relaxation approach is 0(^/n29). 



3.2 An 0(n2^”/^)) Sorting-Based Procedure 

Laurence Wolsey [21] suggested a solution approach based on sorting. We first 
describe this procedure for the subset sum feasibility problem (SSFP). 

SSFP: Given an integer n, an integral n-vector a = (ai, ...,a„) and an integer 
6, is {x e {0, 1}" : ajXj = b}^(t>l 

The procedure SSP(n,o,6) described below decomposes the problem into two 
subproblems each of size It then sorts the 2 s' subset sums for both the sub- 
problems and traverses the two lists containing these subset sums in opposite 
directions to find two values that add up to b. WLOG, we assume n is even. 
SSP(n,g,6) 



1. Let p = = {ai, ..., Gp} and v'^ = {op+i, ..., a„}. Gompute SS^ and 

the arrays of subset sums of the power sets of and respectively. 

2. Sort SS^ and SS"^ in ascending order. 
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3. Let k = 2P, i = 1 and j = k. 

do while ((i < k) OR (j > 1)) { 

If = b) then quit. (Answer to SSFP is “yes”) 

else if < 6) then set t i + 1 

else set j = j — 1 }• 

4. Answer to SSFP is “No” . 

The complexity of this procedure is dominated by the sorting step and there- 
fore it is 0{n2^). This procedure can be extended to answer FP as follows: For 
1 = 1, multiply the constraint by and add all the constraints 

to obtain a single surrogate constraint 

n m m 

j—l 1—1 i—1 

Let a = b = X)™ Call 

SSP(n,a,6). 

Note: As for dynamic programming, this technique is suitable only for problems 
with a few constraints. If is too large, a smaller number can be used 

but then the one-to-one correspondence between the solutions of FP and the 
solutions of the surrogate constraint is lost. In this case note that, if for some 
i and j we have -I- = 6, the corresponding 0-1 solution may not 

satisfy FP. So, in order to solve FP, we need to find all the pairs i, j such that 

3.3 Computational Experience 

See Table 3. For the group relaxation, we use Si = 8, for all i = for 

the 5 instances of size 3 x 20 and Si = 16, for all i = 1, ...,m, for the remaining 
instances. Times reported refer to seconds on an HP720 Apollo desktop worksta- 
tion with 64 megabytes of memory. The sorting-based procedure dominates the 
group relaxation for instances up to 40 variables. The group relaxation could 
solve all the instances but is very expensive for larger problems. Within each 
problem setting, there is very little difference in the amount of time taken by 
the group relaxation. This is not surprising since, within each problem setting, 
the number of solutions to the group relaxation is about the same. A similar 
observation holds for the subset sum approach as well as dynamic programming. 

4 Conclusions 

In this paper, we consider a class of 0,1 programs with n variables, where n is a 
multiple of 10. As noted in the previous section, although the group relaxation is 
able to solve problem instances with up to 50 variables, its running time increases 
rapidly. Its space complexity and expected time complexity can be estimated to 
be 0(i/n2's'). It is an open question to find an algorithm with expected time 
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Table 3. Computing time comparisons. 



Problem 
size (m x n) 


B & B 


B & C 


DP 


Basis 


Group 


Subset Sort 


3 X 20 


11.76 


213.62 


0.93 


300.42 


0.13 


0.01 


3 X 20 


13.04 


125.47 


1.28 


209.79 


0.11 


0.01 


3 X 20 


13.16 


208.76 


0.94 


212.03 


0.11 


0.01 


3 X 20 


13.64 


154.71 


1.31 


277.41 


0.12 


0.03 


3 X 20 


11.21 


190.81 


1.11 


197.37 


0.12 


0.03 


4 X 30 


1542.76 


*** 


20.32 


*** 


18.06 


0.99 


4 X 30 


1706.84 


*** 


20.37 


*** 


17.83 


1.03 


4 X 30 


2722.52 


*** 


20.31 


*** 


17.92 


0.98 


4 X 30 


2408.84 


*** 


18.43 


*** 


17.93 


1.00 


4 X 30 


2977.28 


*** 


18.94 


*** 


18.04 


1.01 


5 X 40 


*** 


*** 


-f-f-f 


*** 


1556.43 


46.10 


5 X 40 


*** 


*** 


-f-f-f 


*** 


1562.66 


46.18 


5 X 40 


*** 


*** 


-f-f-f 


*** 


1604.02 


45.61 


5 X 40 


*** 


*** 


-f-f-f 


*** 


1548.55 


46.20 


5 X 40 


*** 


*** 


-f-f-f 


*** 


1606.24 


45.51 


6 X 50 






-f-f-f 




26425.01 


-f-f-f 


6 X 50 


*** 


*** 


-f-f-f 


*** 


26591.28 


-f-f-f 


6 X 50 


*** 


*** 


-f-f-f 


*** 


26454.30 


-f-f-f 


6 X 50 


*** 


*** 


-f-f-f 


*** 


27379.04 


-f-f-f 


6 X 50 


*** 


*** 


-f-f-f 


*** 


27316.98 


-f-f-f 



** Time limit (54000 seconds) exceeded. 
H — h+ Space limit exceeded. 



complexity better than 0(2'2 ). Our computational experience indicates that the 
standard approaches to integer programming are not well suited for this class 
of problems. As such, we would like to present these small-sized 0-1 integer 
programs as a challenge for the research community and we hope that they may 
lead to new algorithmic ideas. 
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Abstract. A cactus tree is a simple data structure that represents all 
minimum cuts of a weighted graph in linear space. We describe the first 
algorithm that can build a cactus tree from the asymptotically fastest 
deterministic algorithm that finds all minimum cuts in a weighted graph 
— the Hao-Orlin minimum cut algorithm. This improves the time to 
construct the cactus in graphs with n vertices and m edges from 0(n®) 
to 0(nm log n^/m). 



1 Introduction 

A minimum cut of a graph is a non-empty, proper subset of the vertices such 
that the sum of the weights of the edges with only one endpoint in the set is 
minimized. An undirected graph on n vertices can contain up to ( 2 ) minimum 
cuts [7,4]. For many applications, it is useful to know many, or all minimum cuts 
of a graph, for instance, in separation algorithms for cutting plane approaches 
to solving integer programs [5,8,2], and in solving network augmentation and 
reliability problems [3] . Many other applications of minimum cuts are described 
in [19,1]. 

In 1976, Dinits, Karzanov, and Lomonosov [7] published a description of a 
very simple data structure called a cactus that represents all minimum cuts of 
an undirected graph in linear space. This is notable considering the number of 
possible minimum cuts in a graph, and the space needed to store one minimum 
cut. Ten years later, Karzanov and Timofeev [16] outlined the first algorithm 
to build such a structure for an unweighted graph. Although their outline lacks 
some important details, it does provide a framework for constructing correct 
algorithms [20]. In addition, it can be extended to weighted graphs. 

* The full paper is available at http://www.ieor.columbia.edu/~lisa/papers.html. 
This work supported in part by ONR through an NDSEG fellowship, by AASERT 
through grant N00014-95- 1-0985, by an American Association of University Women 
Educational Foundation Selected Professions Fellowship, by the NSF PYI award of 
Eva Tardos, and by NSF through grant DMS 9505155. 
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The earliest algorithm for finding all minimum cuts in a graph uses maximum 
flows to compute minimum (s,t)-cuts for all pairs of vertices (s,t). Gomory 
and Hu [12] show how to do this with only n (s,t)-flow computations. The 
fastest known deterministic maximum flow algorithm, designed by Goldberg 
and Tarjan [11], runs in 0{nmlog(n^ /m)) time. Hao and Orlin [13] show how 
a minimum cut can be computed in the same asymptotic time as one run of 
the Goldberg-Tarjan algorithm. Using ideas of Picard and Queyranne [18], the 
Hao-Orlin algorithm can be easily modified to produce all minimum cuts. Karger 
and Stein [15] describe a randomized algorithm that finds all minimum cuts in 
0{n^ log^ n) time. Recently, Karger has devised a new, randomized algorithm 
that finds all minimum cuts in 0{n^ logn) time. 

The Karzanov-Timofeev outline breaks neatly into two parts: generating a 
sequence of all minimum cuts, and constructing the cactus from this sequence. 
This two-phase approach applies to both unweighted and weighted graphs. It is 
known that the second phase can be performed in O(n^) time for both weighted 
and unweighted graphs [16,17,20]. The bottleneck for the weighted case is the 
first phase. Thus the efficiency of an algorithm to build a cactus tree depends 
on the efficiency of the algorithm used to generate an appropriate sequence of 
all minimum cuts of an undirected graph. The Karzanov-Timofeev framework 
requires a sequence of all minimum cuts found by generating all (s, t) minimum 
cuts for a given s and a specific sequence of t’s. The Hao-Orlin algorithm also 
generates minimum cuts by finding all (s, t) minimum cuts for a given s and a 
sequence of t’s. However, the order of the t’s cannot be predetermined in the 
Hao-Orlin algorithm, and it may not be an appropriate order for the Karzanov- 
Timofeev framework. 

All minimum cuts, produced by any algorithm, can be sequenced appro- 
priately for constructing a cactus tree in 0{nf) time. This is not hard to do 
considering there are at most (”) minimum cuts, and each can be stored in 
0{n) space. The main result of this paper is an algorithm that constructs an 
appropriate sequence of minimum cuts within the same time as the asymptot- 
ically fastest, deterministic algorithm that finds all minimum cuts in weighted 
graphs, improving the deterministic time to construct the cactus in graphs with 
n vertices and m edges from 0{nf) to 0(nmlog(n^/m)). 

Why build a cactus tree? Any algorithm that is capable of computing all 
minimum cuts of a graph implicitly produces a data structure that represents 
all the cuts. For instance, Karger’s randomized algorithm builds a data struc- 
ture that represents all minimum cuts in 0{k + nlogn) space, where k is the 
number of minimum cuts. The cactus tree is special because it is simple. The 
size of a cactus tree is linear in the number of vertices in the original graph, 
and any cut can be retrieved in time linearly proportional to the size of the cut. 
In addition, the cactus displays explicitly all nesting and intersection relations 
among minimum cuts. This, as well as the compactness of a cactus, is unique 
among representations of minimum cuts of weighted graphs. 

Karzanov and Timofeev [16] propose a method of constructing a cactus tree 
using the chain representation of minimum cuts (described in Section 2.4). Their 
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algorithm is lacking details, some of which are provided by Naor and Vazirani [17] 
in a paper that describes a parallel cactus algorithm. Both of these algorithms 
are not correct since they construct cacti that may not contain all minimum cuts 
of the original graph. De Vitis [20] provides a complete and correct description 
of an algorithm to construct a cactus, based on the ideas in [16] and [17]. Karger 
and Stein [15] give a randomized algorithm for constructing the chain represen- 
tation in O(n^log^n) time. Bencziir [3] outlines another approach to build a 
cactus without using the chain representation. However, it is not correct since 
it constructs cacti that may not contain all minimum cuts of the original graph. 
Gabow [9,10] describes a linear-sized representation of minimum cuts of an un- 
weighted graph and gives an 0{m + A^nlog(n/A)) time algorithm to construct 
this representation, where A is the number of edges in the minimum cut. 

This paper describes how to modify the output of the Hao-Orlin algorithm 
and rearrange the minimum cuts into an order suitable for a cactus algorithm 
based on Karzanov-Timofeev framework. The algorithm presented here runs in 
0{nm + n^logn) time — at least as fast as the Hao-Orlin algorithm. Since an 
algorithm based on the Karzanov-Timofeev outline requires O(n^) time, plus 
the time to find and sort all minimum cuts, this leads to the fastest known 
deterministic algorithm to construct a cactus of a weighted graph, and the fastest 
algorithm for sparse graphs. 

2 Preliminaries 

2.1 Definitions and Notation 

We assume the reader is familiar with standard graph terminology as found in [6] . 
A graph G = (V,E) is defined by a set of vertices V, with \V\ = n and a set 
of edges E C KxK, with \E\ = m. A weighted graph also has a weight function 
on the edges, w : E ^ iR. For the purposes of this paper, we assume w is non- 
negative. A cut in a graph is a non-empty, proper subset of the vertices. The 
weight of a cut C is the sum of weights of edges with exactly one endpoint in the 
set. The weight of edges with one endpoint in each of two disjoint vertex sets S 
and T is denoted as w{S, T). A minimum cutis a cut C with w{C, C) < w{C' , C') 
for all cuts C' . An {S,T)-cut is a cut that contains S and is disjoint from T. A 
(S', T) minimum cut is a minimum cut that separates S and T. Note that if the 
value of the minimum (S, T)-cut is greater than the value of the minimum cut, 
there are no (S, T) minimum cuts. 



2.2 The Structure of Minimum Cuts of a Graph 

A graph can have at most ( 2 ) minimum cuts. This is achieved by a simple cycle 
on n vertices: there are (”) choices of pairs of edges broken by a minimum cut. 
This is also an upper bound as shown in [7,4] or, with a simpler proof, in [14]. 

Let A be the value of a minimum cut in graph G = (V, E). The proof of the 
next lemma is in the full paper. 
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Lemma 1. If Si and S 2 are minimum cuts such that none of A = Si n S 2 , 
B = S'i\S' 2 , C = S 2 \S\, or D — S\yj S 2 is empty, then 

1. A, B, C, and D are minimum cuts, 

2. w{A,D) = w{B,C) = 0, 

3. w{A, B) = w{B, D) = w{D, C) = w{C, A) = A/2. 

Two cuts (S'!, S'!) and {S 2 , S 2 ) that meet the conditions of the above lemma are 
called crossing cuts. A fundamental lemma further explaining the structure of 
minimum cuts in a graph is the Circular Partition Lemma. This lemma is proven 
by Bixby [4] and Dinits, et al. [7], with alternate proofs in [3,20]. 

Definition 2. A circular partition is a partition ofV into k > 3 disjoint subsets 
Vi, V 2 , ■ • ■ ,Vk, such that 

— w{Vi, Vj) = A/2 when i — j = 1 mod k and equals zero otherwise. 

— For l<a<b<k, A = uJz/Vi is a minimum cut, and if B is a minimum 
cut such that B or B is not of this form, then B or B is contained in some 
V^. 



Lemma 3 (Bixby [4]; Dinits, Karzanov, and Lomonosov [7]). If G con- 
tains crossing cuts, then G has a circular partition. 

The proof of this lemma is long and technical, and it is omitted here. The 
proof uses Lemma 1 to argue that any minimum cut not represented in a circular 
partition must be contained in one of the sets of the partition. 

It may be that G has more than one circular partition. Let P := {Vi, . . . ,14} 
and Q := {Ui, . . . ,Ui} he two distinct circular partitions of V. These partitions 
are compatible if there is a unique i and j such that Ur C Vi for all r ^ j and 
Vs C Uj for all s ^ i. Proofs of the next two statements are in the full paper. 

Corollary 4. Any two circular partitions of a graph G are compatible. 

A set of sets is called laminar if every pair sets is either disjoint or one set is 
contained in the other. Laminar sets that do not include the empty set can be 
represented by a tree where the root node corresponds to the entire underlying 
set, and the leaves correspond to the sets that contain no other sets. The parent 
of a set A is the smallest set properly containing A. Since the total number of 
nodes in a tree with n leaves is less than 2n, the size of the largest set of laminar 
sets of n objects is at most 2n — 1. 

Lemma 5. There are at most n — 2 distinct circular partitions of a graph on n 
vertices. 
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Fig. 1. Cactus of a graph. 



2.3 The Cactus Representation 

If G has no circular partitions, then G has no crossing cuts. In this case, by 
considering the smaller side of every cut, the minimum cuts of G are laminar 
sets. Hence we can represent the minimum cuts of G by the tree used to represent 
laminar sets mentioned in the previous section. There is a 1-1 correspondence 
between the minimum cuts of G and the minimum cuts of this tree. 

In order to represent crossing minimum cuts, it is necessary to add cycles to 
this tree representing circular partitions of G. A cactus of a graph is a data struc- 
ture that succinctly represents all minimum cuts of G by succinctly representing 
all circular partitions of G. 

A cactus is a tree-like graph that may contain cycles as long as no two cycles 
share more than one vertex: every edge in the cactus lies on at most one cycle. 
Figure 1 contains an example of a cactus. A cactus of a graph G, denoted 7i(G), 
has in addition a mapping tt that maps vertices of G to vertices of 7i(G), and a 
weight assigned to every edge. To distinguish the vertices of a graph from those 
of its cactus, we refer to the vertices of a cactus as nodes. If A is the value of 
the minimum cut in G, then each cycle edge of Ti{G) has weight A/2, and each 
path edge has weight A. The mapping tt is such that every minimal cut M of 
'H(G) corresponds to a minimum cut of G, and every minimum cut in 

G equals for some minimal cut M of Ti{G). If = 0, we say that 

i is an empty node. Figure 1 contains a graph and a corresponding cactus. The 
proofs of the next theorem and corollary are in the full paper. 

Theorem 6 (Dinits, Karzanov, and Lomonosov [7]). Every weighted graph 
has a cactus. 



Corollary 7. Every weighted graph on n vertices has a cactus with no more 
than 2n vertices. 

As defined here, and in [7], a graph does not necessarily have a unique cactus. 
For instance, a cycle on three nodes can also be represented by a cactus that 
is a star: an empty node of degree three, and three nodes of degree one each 
containing a distinct vertex. There are many rules that could make the definition 
unique [17,20]. We will follow De Vitis [20] in the definition of a canonical cactus. 
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Let z be a node on cycle Y of H{G). Let Cf be the component containing z 
formed by removing the edges adjacent to z on Y, and let We 

call i trivial if VY' — removing CY and making the neighbors of z on y 

neighbors in Y, we can assume Ti{G) has no trivial nodes. We also assume that 
Ti(G') has no empty, 3- way cut-nodes: an empty cut-node whose removal breaks 
'H(G) into exactly three components can be replaced by a 3-cycle. Finally, we 
assume there are no empty nodes z in Ti{G) of degree three or less: either z is 
a 3-way cut-node, and handled as above, or it is a 2-way cut-node, and hence 
we can contract an incident cut-edge and still maintain a representation of all 
minimum cuts of G in Ti.{G). A canonical cactus of a graph is a cactus with no 
trivial nodes, no empty 3-way cut-nodes, and no empty nodes with degree < 3. 
The following theorem is not hard to prove. 

Theorem 8 (De Vitis [20]). Every weighted graph has a unique canonical 
cactus. 

Henceforth, cactus will be used to mean canonical cactus. In this canonical 
representation, every circular partition of G corresponds to a cycle of 7i(G) and 
every cycle of 7f(G) represents a circular partition of G. For cycle Y, the VY' 
are the sets of the corresponding circular partition. 



2.4 The Chain Representation of Minimum Cuts 

Karzanov and Timofeev [16] make an important observation about the struc- 
ture of minimum cuts, which they use to build the cactus: if two vertices s and 
t are adjacent in the graph G, then the minimum cuts that separate these ver- 
tices are nested sets. This motivates assigning vertices of G an adjacency order 
{zzi, . . . ,Vn} so that Vi+i is adjacent to a vertex in Vi := {fi, . . . ,Vi}. Let Mi 
be the set of minimum cuts that contain Vi but not fz+i. We will refer to such 
minimum cuts as (Vj, fi+i) minimum cuts. The following lemma summarizes the 
observation of [16]. 

Lemma 9. If the vertices in G are adjacency ordered, then all cuts in Mi are 
non-crossing. 

This implies that the cuts in Mi form a nested chain and can be represented by 
a path Pi of sets that partition C : let Ai C A2 C • • • C A/ be the minimum cuts 
separating Vj and zzi+i. The first set X\ of the path is Ai, and each additional 
set Xr = Ar\Ar-i, with the final set equal to V\Ai. Each link of the 

path represents the minimum cut Ar that would result if the path were broken 
between sets Xr and Xr+i. 

Note that, for any ordering of vertices, the Mi form a partition of the min- 
imum cuts of G: Vi and Vi+\ are the smallest, consecutively indexed pair of 
vertices that are separated by a cut, if and only if the cut is in Mi. The set of 
Pi for all 1 < z < rz is called the chain representation of minimum cuts. For this 
reason, we will refer to these paths as chains. 
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2.5 Prom the Chain Representation to the Cactus 

The algorithm outlined by Karzanov and Timofeev [16], refined by Naor and 
Vazirani [17], and corrected by De Vitis [20], builds the cactus representation 
from the chain representation of minimum cuts. For completeness, an outline 
of the algorithm is presented here. Let Gi represent the graph G with nodes 
in Vi contracted into one node. Let Gr be the smallest such graph that has a 
minimum cut of value A (r is the largest index of such a graph) . The algorithm 
starts with the cactus for Gr, a pair of vertices incident to one edge. It then 
builds the cactus for Gi-i from Gi using the following observation that follows 
from the fact that cuts in Mi are non-crossing. 

Corollary 10. Let the vertices in G have an adjacency ordering. For each non- 
empty Mi, there is a path in the cactus of Gi that shares at most one edge with 
every cycle, and such that all cuts in Mi cut through this path. 

By contracting the path described in the lemma into a single node, all cuts 
in Mi are removed from 7i{Gi), and no other minimum cuts are removed. Thus 
the resulting structure is H{Gi+i). 

Working in the opposite direction, in iteration i, the cactus algorithm replaces 
the node u in H{Gi+i) that contains vertices {ui, . . . , Vi} with the path Pi that 
represents the chain of cuts Ci in Mi. Edges that were incident to u are then 
joined to an appropriate node in Pi. The mapping of vertices of Gi to vertices 
of the cactus are updated. Lastly, post-processing forms the canonical cactus of 
Gi. More precisely, the algorithm proceeds as follows. 

(i) If Gi = Xi, . . . , Xk then replace u hy k new nodes u\, . . . , Uk with edges 
{uj,Uj+i) for 1 < j < A:. 

(ii) For any tree or cycle edge {u, w) in 7f(Gi+i), let IF yf 0 be the set of vertices 
in w, or if w is an empty node, the vertices in any non-empty node w' 
reachable from w by some path of edges disjoint from a cycle containing 
(u,w). Find the subset Xj such that W C Xj and connect w to Uj. 

(iii) Let U be the set of vertices mapped to H{Gi+i). Assign to Uj the set Xjr\U. 
All other mappings remain unchanged. 

(iv) Remove all empty nodes of degree < 2 and all empty 2-way cut nodes by 
contracting an adjacent tree edge. Replace all empty 3-way cut-nodes with 
3-cycles. 

The correctness of this procedure is easy to prove by induction using the 
previous observations and the following two lemmas. 

Lemma 11. If {u,w) is a tree edge ofH{Gi+i) and T C V is the set of vertices 
in the subtree attached to u by (u,w), then T C Xj for some j. 

Lemma 12. Let (u,w) be a cycle edge of T-L{Gi+i) that lies on cycle Y, and 
let Ti, T 2 , . . . ,Tr be the circular partition represented by Y with Vi C T\. Then 
either UI =2 some j, or there are indices a and b such that T 2 = Xa, 

T3 = Xa+i, . . . , Tr = Xb. 
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Clearly, operations (i)-(iii) of the above procedure can be performed in linear 
time, thus requiring O(n^) time to build the entire cactus from the chain rep- 
resentation of minimum cuts. Operation (iv) requires constant time per update. 
Each contraction removes a node from the cactus, implying no more than 0{n) 
contractions per iteration. Each cycle created is never destroyed, and since the 
total number of circular partitions of a graph is at most n, there can not be more 
than n of these operations over the course of the algorithm. 

Theorem 13. The canonical cactus of a graph can he constructed from the chain 
representation of all minimum cuts of the graph in 0{n^) time. 



2.6 Finding All Minimum Cuts in a Weighted Graph 

To find all minimum cuts, we make minor modifications to the minimum cut 
algorithm of Hao and Orlin [13]. The Hao-Orlin algorithm is based on Goldberg 
and Tarjan’s preflow-push algorithm for finding a maximum flow [11]. It starts 
by designating a source vertex, ui, assigning it a label of n, and labeling all other 
vertices 0. For the first iteration, it selects a sink vertex, U 2 , and sends as much 
flow as possible from the source to the sink (using the weights as arc capacities), 
increasing the labels on some of the vertices in the process. The algorithm then 
repeats this procedure n — 2 times, starting each time by contracting the current 
sink into the source, and designating a node with the lowest label as the new 
sink. The overall minimum cut is the minimum cut found in the iteration with 
the smallest maximum flow value, where the value of a flow is the amount of 
flow that enters the sink. 

To And all minimum cuts with this algorithm, we use the following result. 
Deflne a closure of a set A of vertices in a directed graph to be the smallest set 
containing A that has no arcs leaving the set. 

Lemma 14. (Picard and Queyranne [18]) There is a 1-1 correspondence 
between the minimum {s,t)~cuts of a graph and the closed vertex sets in the 
residual graph of a maximum {s,t)-flow. 

Let Ui be the set of vertices in the source at iteration i of the Hao-Orlin routine, 
l<t<n— l.Ifa complete maximum flow is computed at iteration i, Lemma 14 
implies that the set of (Ui,Ui+i) minimum cuts equals the set of closures in the 
residual graph of this flow. This representation can be made more compact by 
contracting strongly connected components of the residual graph, creating a 
directed, acyclic graph (DAG). We refer to the graph on the same vertex set, 
but with the direction of all arcs reversed (so that they direct from Ui to Ui+i), 
as the DAG representation of (Ui,Ui+i) minimum cuts. 

The Hao-Orlin routine does not necessarily compute an entire maximum flow 
at iteration i. Instead it computes a flow of value > A that obeys the capacity 
constraint for each edge, but leaves excess flow at some nodes contained in a dor- 
mant region that includes the source. Let Ri denote the vertices in the dormant 
region at the end of the iteration of the Hao-Orlin algorithm. The following 
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lemma follows easily from the results in [13]. Together with the above observa- 
tions, this implies that at the end of iteration i of the Hao-Orlin algorithm, we 
can build a linear-sized DAG representation of all minimum cuts. 

Lemma 15. Ri is contained in the source component of the DAG representation 
of (Ui,Ui+i) minimum cuts. 

The second and more serious problem with the Hao-Orlin algorithm is that 
the ordering of the vertices implied by the selection of sinks in the algorithm 
may not be an adjacency ordering as required for the construction phase of the 
Karzanov-Timofeev framework. This means that some of the DAGs may not 
correspond to directed paths. We address this problem in the next section. 

3 The Algorithm 

In this section, we discuss how to transform the DAG representation of all min- 
imum cuts into the chains that can be used to construct a cactus representation 
of all minimum cuts. To do this, we use an intermediate structure called an 
(S', T)-cactus. 

3.1 (S,T)-Cacti 

An (S, T)-cactus is a cactus representation of all minimum cuts separating vertex 
sets S and T. Note that an (S, T)-cactus is not necessarily a structure represent- 
ing minimum (S, T)-cuts, but overall minimum cuts that separate S and T. If the 
minimum (S, T)-cut has value greater than the minimum cut, then the (S, T)- 
cactus is a single node containing all vertices. The proof of the following lemma 
is in the full paper. 

Lemma 16. An {S,T)-cactus is a path of edges and cycles. 

Each cycle in an (S, T)-cactus has two nodes that are adjacent to the rest of the 
path, one on the source side, one on the sink side. The other nodes on the cycle 
form two paths between these two end nodes. We will call these cycle-paths. 
Thus each cycle in the {S, T)-cactus can be described by a source-side node, a 
sink-side node, and two cycle-paths of nodes connecting them. The length of a 
cycle-path refers to the number of nodes on the cycle-path. A zero length cycle- 
path implies that it consists of just one arc joining the source-side node to the 
sink-side node. 

Without loss of generality, we assume that no cycle of an (S', T)-cacti has a 
zero length cycle-path. Any cycle of an (S, T)-cactus that contains a zero length 
cycle-path can be transformed into a path by deleting the arc in the zero length 
cycle-path. This operation does not delete any (S, T)-cut represented by the 
(S, T)-cactus. 

Note that Lemma 9 implies that an (S, T)-cactus of a chain is a path without 
cycles. A first step in building these chains needed for the cactus algorithm is 
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constructing the n — 1 {S, T)-cacti of the n — 1 DAG’s output by the Hao-Orlin 
algorithm. From these (S, T)-cacti, we can then construct chains needed for the 
cactus algorithm. 

Benczur [3] outlines how to build an {S, T)-cactus from the DAG representa- 
tion of all minimum cuts separating S and T. The algorithm is a topological sort 
on the DAG, with some post-processing that can also be performed in O(m-l-n) 
time. For completeness, we describe the algorithm here. 

For two different cycles of the {S, T)-cactus, all the nodes on the cycle closer 
to the source must precede all the nodes on the further cycle in any topological 
ordering. Thus a topological sort sequences the nodes of the DAG so that nodes 
on a cycle in the (S', T)-cactus appear consecutively. It remains to identify the 
start and end of a cycle, and to place the nodes of a cycle on the proper chain. 

For a node i on cycle F of a cactus H{G), define Cf to be the component 
containing i formed by removing the edges adjacent to i on Y , and define = 
If = 0 for some Y, we can remove and make the neighbors of i 
on Y neighbors in Y. Thus we assume 'H(G) has no cycle nodes i with = 0. 
The proof of the following lemma is contained in the full paper. 

Lemma 17. Let Y he a cycle on three or more nodes of cactus ’H(G) then 

1. The sum of the costs on edges between any two vertex sets and of 
adjacent nodes i and j on cycle Y equals A/2. 

2. There are no edges between vertex sets and of nonadjacent nodes i 
and h on Y. 

Since we can construct an (S, T)-cactus from the cactus of the complete graph 
by contracting nodes, this lemma also applies to (S, T)-cacti: on a cycle of an 
(S, T)-cactus, only adjacent nodes share edges. 

We can now walk through the topological sort of a DAG produced by the 
Hao-Orlin algorithm to create the corresponding (S', T)-cactus. As long as the 
weight of the edges between two consecutive components in the topological order 
equals A, the value of a minimum cut, the (S, T)-cactus is a path. The first time 
we encounter a value < A, the path divides into two cycle-paths of a cycle. 
We stay with one cycle-path as long as the weight of the edges between two 
consecutive components = A/2. If it is zero, we jump to the other cycle-path. If 
it is anything in between A/2 and zero, we end the cycle with an empty vertex, 
and start a new cycle. If it is A, we end the cycle with the current vertex, and 
resume a path. At the end of the topologically ordered list, we end the cycle (if 
started) with the last component. 

Lemma 18. (Benczur [3]) A DAG representing strongly connected compo- 
nents of the residual graph of a maximum (s, t)-flow can he transformed into an 
{s,f)-cactus in 0{m-\-n) time. □ 

3.2 Constructing the Chains 

In this section we describe how to efficiently build chains Pi of minimum cuts 
that can be used to construct a cactus representation of all minimum cuts from 
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the Uj)-cacti, I < j < n. Let Dj be the {Uj-i , Uj)-ca,ctus for I < j < n. 

We start by fixing an adjacency ordering {ui,... ,Vn} with vi = ui and let 
a : V ^ V he the permutation a{i) = j if Vi = Uj. 

The algorithm we present below builds each chain Pi, 1 < i < n, one at 
a time by reading cuts off of the Dj. The basic idea is to use the following 
fact established below: If any two cuts from Dj are such that one is completely 
contained within the other, then the index of the chain to which the first belongs 
is no more than the index of the chain to which the second belongs. Thus, all 
the minimum cuts belonging in Pi that are contained in one Dj are consecutive; 
and, as we walk through the cuts from Uj-\ to Uj in Dj, the index of the chain 
to which a cut belongs is non-decreasing. The validity of these statements is 
established in the next two subsections. 

The Code 
(s,t)-CactiToChains (v) 

0. For z = 2 to n, 

0 . Pi = 0 . 

0. For j = 2 to n — 1, 

1. riij := the node in Dj that contains Vi. 

2. If riij is not the source node, 

3. Identify the path of nodes from source to predecessor of riij in Dj. 

4. If Pi = 0, let the source node of Dj be the source node of Pi. 

5. Else, add vertices in the source but not in Pi to the last node of Pi. 

6. Append the remaining path to Pi, skipping empty nodes; 

7. and, in Dj, contract this path into the source node. 

8. If riij is on a cycle in Dj, 

9. Append to Pi the chain of the cycle that does not contain riij. 

10. Contract riij into the source node of Dj. 

11. Add a component to Pi that contains only Uj. 

12. If Pi yf 0, add the vertices not in Pi to the last node of Pi. 

Correctness: In this section we prove that each Pi constructed by the above 
algorithm represents all cuts in Mi. Define a phase to be one pass through the 
outer while loop. Pi is complete when it contains all vertices. A node of Pi is 
complete when either a new node of Pi is started, or Pi is complete. For example, 
a node started at Step 11 is not complete until after the next execution of Step 
5 or Step 12. A cut in Dj is obtained by either removing an arc on a path or a 
pair of arcs on two different cycle-paths of a cycle. The one or two nodes that 
are endpoints of these arcs on the source side are said to represent this cut. 

Theorem 19. Each chain Pi constructed by the algorithm contains precisely the 
cuts in Mi. 

The proof of this theorem establishes the correctness of the algorithm and 
depends upon the following lemmas. 
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source sink 




Fig. 2. The (s,t)-cactus Dj, updated for iteration i. The black nodes represent 
the possible locations of rnj. 



Lemma 20. At the end of phase i, all vertices Vk,k < i are contained in the 
source node of every remaining Dj. 

Proof. Proof is by induction. At start, all sources contain vertex v\ = u\. Assume 
vertices Vk, k < i are contained in the sources of the remaining Dj at the start 
of phase i. For each j, if is not the source node, then is contracted into 
the source in Step 10. □ 

Lemma 21. In phase i, for all j, Uij either lies on the path in Dj from the 
source to the first node of the cycle closest to the source, or is one of the two 
nodes on the cycle adjacent to this path, as in Figure 2. 

Proof. Figure 2 shows the possible locations of in Dj as specified in the 
lemma. We show that if there is a cycle between and the source Sj , then we 
can find crossing cuts in Mi, contradicting our choice of the ordering of the Vi. 
If Uij is not in one of the locations stipulated in the lemma, then some cycle 
contains at least one node that either precedes, or is unrelated to n^, on both 
of its cycle-paths. (Recall that we are assuming that every cycle-path has length 
greater than zero.) Thus there are two crossing minimum cuts that separate the 
source of Dj from Vi. Lemma 20 implies that Vi_i is contained in the source, 
and hence these are cuts in Mi, contradicting Lemma 1. □ 

Lemma 22. The algorithm constructs chains that contain every vertex exactly 
once. 

Proof. By construction, each Pi is a chain that contains every vertex. Suppose, 
in iteration j, we are about to add a vertex Ur to the chain Pi. If Ur is in the 
source component of Dj, we explicitly check before we add Ur. Otherwise, r > j 
and there is some cut that separates vertices in Uj-i and Vi-i from vertices Ur 
and Uj. All cuts added to Pi before this iteration come from a Dk with k < j, and 
thus contain a strict subset of vertices in Uj-\. By Lemma 9, the cuts separating 
Vi-i from Vi are nested, and so none of the previous cuts contain Ur in the source 
side. Thus we add Ur to Pi exactly once. □ 



Lemma 23. Each complete node added to Pi represents the same cut that is 
represented by the node in the current Dj. 
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Proof. Proof is by induction on j . The first node added to Pi is the source node 
of some Dj and clearly both these cuts are the same. Now assume that up until 
this last node, all previous nodes added to Pi represented the corresponding cut 
of the relevant D^, k < j. Adding the current node from Dj, we add or have 
added in previous steps to Pi all the vertices that are in the source side of this 
cut. We must argue that there are no additional vertices in any node preceding 
this one in Pi. Recall that the cuts in Mi are nested. Thus all previous nodes 
added to Pi represent cuts strictly contained in the current cut. Hence the node 
added to Pi represents the same cut this node represents in the current Dj. □ 

Lemma 24. Each complete node added to Pi, 1 < i < n, except the last, repre- 
sents a distinct cut in Mi. 

Proof. Since each node added to Pi contains some vertex, and no vertex is re- 
peated, each represents a distinct cut. When Pi is started. Lemma 20 implies 
that all vertices in Vi-\ are contained in the source. Vertex Vi is only added in 
Step 11, when j = a{i). This starts the final node of Pi since Vi = Uj is con- 
tained in the source nodes of all Dk, k > j. Hence all previous nodes added to 
Pi represent cuts that contain V-i but not Vi, and are thus cuts in Mi. □ 

Proof of Theorem 19. For a fixed cut C, let Vi be the vertex of least index in C. 
We show that this cut is represented in Pi. By the end of each phase k, k < i, the 
algorithm contracts all nodes on the path from the source to the node containing 
Vk. Since Vi-i is contained in the source side of C, this cut is not contracted 
in Dj at the start of phase i. If the cut C is represented in Dj by one node, it 
lies on the path of Dj from the expanded source to . Thus this node is added 
to Pi in either Step 5 or 6. Otherwise the cut is through the same cycle that 
contains . One node representing the cycle is the node immediately preceding 
Uij, currently source-side node of the cycle. The other node lies on the cycle-path 
not containing Uij. This cut is added to Pi in Step 9, when this latter node is 
added. □ 

Efficiency: Some care is needed to implement the above algorithm to complete 
within the same asymptotic time as the Hao-Orlin algorithm. Within the inner 
“for” loops, operations should not take more than 0(log n) time. We show, that 
with minor preprocessing and record keeping, all operations will take no more 
than logarithmic time per operation, or can be charged to some set of minimum 
cuts. 

Before starting the algorithm, we walk through each (s, t)-cactus from t to 
s, labeling every node with the smallest index of a vertex Vi contained in that 
node, and labeling every arc with the smallest index on the sink side of the arc. 
This takes linear time per (s, t)-cactus, for a total of 0{n^) time. 

We perform one more preprocessing step. For each vertex, we create an array 
of length n — 1. In the array for vertex Vi, entry j indicates if Vi is in the source 
component of Dj. This takes O(n^) time to initialize; and, through the course 
of the algorithm, it will take an additional 0{n^) time to maintain, since once a 
vertex is contracted into the source, it stays there. 
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Theorem 25. Algorithm (s,t)-CactiToChains runs in O(n^logn) time. 

Proof. In Step 1, we use the labeling of the arcs to walk through Dj starting from 
the source until we find Uij. By Lemmas 20 and 21, all these cuts are {Vi-i,Vi)~ 
cuts, and hence the arcs are labeled with Wj. The time to do this depends on the 
out degree of each node encountered in Dj (at most 2), and is charged to the 
minimum cuts we encounter. Note that if none of the arcs leaving the source is 
labeled with the current chain index i, then must be the source node. This 
takes care of Steps 1-3, and Step 8. 

Let’s put aside Steps 5 and 12 and concentrate on the remaining steps. In 
these, we are either adding nodes and vertices to Pi, or contracting nodes of Dj. 
This takes constant time per addition or contraction of vertex. Since each vertex 
is added at most once to Pi and contracted into the source of Dj at most once, 
over the course of the algorithm these steps take no more than O(n^) time. 

Finally, we consider Steps 5 and 12. In these, we need to determine the 
difference of two sets that are increasing over the course of the algorithm. To do 
this efficiently, we wait until the end of a phase to add the vertices that would 
have been added at Step 5. At the end of each phase, we make a list of the 
vertices that are missing from Pi. This can be done in linear time per phase by 
maintaining an array in each phase that indicates if a vertex has been included 
in Pi. For each of the vertices not in Pi, we then need to determine to which 
node of Pi it should be added. The nodes that are possible candidates are the 
incomplete nodes — any node created in Step 11. 

Vertex v is added to the node containing Uk in Step 5 if u is contained in the 
source of Dj, j > k but is not contained in the source of Dk, for iterations k and 
j where Uik and are not source nodes, and k is the highest index with these 
properties. Consider the Dj for which Uij is not the source node. The sources 
of these Dj are nested for any fixed i, by the observation that the cuts of Mi 
are nested. That is, once a vertex is contained in one of them, it is contained 
in the rest. During each phase, we maintain a array-list of the consecutive j for 
which Uij is not the source node. For each vertex, we can now binary search 
through these nested sources using the arrays described before the statement 
of the theorem to find the node of Pi that should contain v in O(logn) time. 
The time spent on this step over the course of the algorithm is then bounded by 
0(n^ logn): logarithmic time per vertex per phase. 

Note that we have added vertices to the sources of these Dj since we started 
the phase. All the vertices we added in Step 7 have also been added to Pi, so 
they are not among the vertices we need to place at the end (i.e. the vertices not 
in Pi). We do need to worry about the vertices added in Step 10, however. To 
avoid this problem, simply postpone the contractions described in Step 10 until 
after we have assigned all vertices to a node of □ 

Note that mnlogn^ /m is never smaller than logn. 

Corollary 26. The chain and the cactus representations of all minimum cuts 
of a weighted graph can both be constructed in 0{mnlogn^/m) time. □ 
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Abstract. We introduce a gain-scaling technique for the generalized 
maximum flow problem. Using this technique, we present three simple 
and intuitive polynomial-time combinatorial algorithms for the problem. 
Truemper’s augmenting path algorithm is one of the simplest combi- 
natorial algorithms for the problem, but runs in exponential-time. Our 
first algorithm is a polynomial-time variant of Truemper’s algorithm. 
Our second algorithm is an adaption of Goldberg and Tarjan’s preflow- 
push algorithm. It is the first polynomial-time preflow-push algorithm in 
generalized networks. Our third algorithm is a variant of the Fat-Path 
capacity-scaling algorithm. It is much simpler than Radzik’s variant and 
matches the best known complexity for the problem. We discuss practical 
improvements in implementation. 



1 Introduction 

In this paper we present new algorithms for the generalized maximum flow prob- 
lem, also known as the generalized circulation problem. In the traditional max- 
imum flow problem, the objective is to send as much flow through a network 
from one distinguished node called the source to another called the sink, subject 
to capacity and flow conservation constraints. In generalized networks, a fixed 
percentage of the flow is lost when it is sent along arc. Specifically, each arc 
(u, w) has an associated gain factor j{v, w). When g{v, w) units of flow enter arc 
(u, w) at node v then j{v, w)g{v, w) arrive at w. The gains factors can represent 
physical transformations due to evaporation, energy dissipation, breeding, theft, 
or interest rates. They can also represent transformations from one commodity 
to another as a result of manufacturing, blending, or currency exchange. They 
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may also represent arc failure probabilities. Many applications are described in 
[1,3,5], 

Since the generalized maximum flow problem is a special case of linear pro- 
gramming, it can be solved using simplex, ellipsoid, or interior-point methods. 
Many general purpose linear programming algorithms can be tailored for the 
problem. The network simplex method can handle generalized flows. Kapoor 
and Vaidya [16] showed how to speed up interior-point methods on network 
flow problems by exploiting the structured sparsity in the underlying constraint 
matrix. Murray [18] and Kamath and Palmon [15] designed different interior- 
point algorithms for the problem. We note that these simplex and interior-point 
methods can also solve the generalized minimum cost flow problem. 

The first combinatorial algorithms for the generalized maximum flow prob- 
lem were the augmenting path algorithms of Jewell [14] and Onaga [19] and 
exponential-time variants. Truemper [22] observed that the problem is closely 
related to the minimum cost flow problem, and that many of the early gen- 
eralized maximum flow algorithms were, in fact, analogs of pseudo-polynomial 
minimum cost flow algorithms. Goldberg, Plotkin and Tardos [7] designed the 
first two combinatorial polynomial-time algorithms for the problem: Fat-Path 
and MCF. The Fat-Path algorithm uses capacity-scaling and a subroutine that 
cancels flow-generating cycles. The MCF algorithm performs minimum cost flow 
computations. Radzik [20] modified the Fat-Path algorithm, by canceling only 
flow-generating cycle with sufficiently large gains. Goldfarb and Jin [12] modified 
the MCF algorithm by replacing the minimum cost flow subroutine with a sim- 
pler computation. Goldfarb and Jin [11] also presented a dual simplex variant of 
this algorithm. Recently, Goldfarb, Jin and Orlin [13] designed a new capacity- 
scaling algorithm, motivated by the Fat-Path algorithm. Tseng and Bertsekas 
[23] proposed an e-relaxation method for solving the more general generalized 
minimum cost flow problem with separable convex costs. However, their running 
time may be exponential in the input size. 

Researchers have also developed algorithms for the approximate generalized 
maximum flow problem. Here, the objective is to And a ,^-optimal flow, i.e., a flow 
that generates excess at the sink that is within a (1 - ^) factor of the optimum, 
where ^ is an input parameter. Cohen and Megiddo [2] showed that the approx- 
imate generalized maximum flow problem can be solved in strongly polynomial- 
time. Their algorithm uses a subroutine which tests feasibility of a linear system 
with two variables per inequality. Radzik [20] observed that the Fat-Path algo- 
rithm can be used to compute approximates flow faster than optimal flows. His 
Fat-Path variant, that cancels only flow-generating cycles with large gain, is the 
fastest algorithm for computing approximate flows. Subsequently, Radzik [21] 
gave a new strongly polynomial-time analysis for canceling all flow-generating 
cycles, implying that the original Fat-Path algorithm computes an approximate 
flow in strongly polynomial-time. For the linear programming algorithms, it is 
not known how to improve the worst-case complexity of the exact algorithms to 
And approximate flows. 
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We present a new rounding technique for generalized flows, which can be 
viewed as a type of gain-scaling. Using this technique, we propose three simple 
combinatorial algorithms for the generalized maximum flow problem. Our first 
algorithm is a polynomial-time variant of Truemper’s [22] algorithm. Truem- 
per’s algorithm is a very simple maximum flow based augmenting path algo- 
rithm, analogous to Jewell’s primal-dual algorithm for the minimum cost flow 
problem. Truemper’s algorithm may require exponential-time, but by apply- 
ing our new gain-scaling technique, we develop a polynomial-time variant. Our 
second algorithm is an adaption Goldberg and Tarjan’s [10] preflow-push algo- 
rithm for the minimum cost flow problem. Using gain-scaling, we establish the 
first polynomial-time preflow-push algorithm for generalized flows. Our third 
algorithm is a simple variant of the Fat-Path capacity-scaling algorithm. By 
using gain-scaling, our Fat-Path variant improves the complexity of canceling 
flow-generating cycles, and hence of the overall algorithm. In contrast, Radzik’s 
Fat-Path variant modifies this subroutine, canceling only flow-generating cycles 
with sufficiently large gain. Both Fat-Path variants have the same complexity, 
but Radzik’s variant and proof of correctness are quite complicated. 

2 Preliminaries 

2.1 Generalized Networks 

Since some of our algorithms are iterative and recursive, it is convenient to solve a 
seemingly more general version of the problem which allows for multiple sources. 
An instance of the generalized maximum flow problem is a generalized network 
G = (U, U, t, u, 7 , e), where V is an n-set of nodes, E is an m-set of directed 
arcs, t G V is a distinguished node called the sink, u: E ^ 3?>o is a capacity 
function, E ^ 3?>o is a gain function, and e: V ^ 3?>o is an initial excess 
function. A residual arc is an arc with positive capacity. A lossy network is a 
generalized network in which no residual arc has gain factor exceeding one. We 
consider only simple directed paths and cycles. The gain of a path P is denoted 
by7(^^) = necp 7 (e). The gain of a cycle is defined similarly. A flow- generating 
cycle is a cycle whose gain is more than one. 

For notational convenience we assume that G has no parallel arcs. Our algo- 
rithms easily extend to allow for parallel arcs and the running times we present 
remain valid. Without loss of generality, we assume the network is symmetric and 
the gain function is antisymmetric. That is, for each arc {v, w) G E there is an 
arc {w, v) G E (possibly with zero capacity) and ^{w, v) = l/j^v, w). We assume 
the capacities and initial excesses are given as integers between 1 and B, and 
the gains are given as ratios of integers which are between 1 and B. To simplify 
the running times we assume B > m, and use 0{f) to denote m. 

2.2 Generalized Flows 

A generalized pseudoflow is a function g: E ^ ^ that satisfies the capacity 
constraints g{v, w) < u{v, w) for all (v, w) G E and the antisymmetry constraints 
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g{v, w) = — 7 (w, v)g{w, v) for all {v, w) G E. The residual excess of g at node v is 
= 6 (w) — w)ge ty), i.e., the initial excess minus the the net flow out 
of V. If eg{v) is positive (negative) we say that g has residual excess (deficit) at 
node V. A flow 5 is a pseudoflow that has no residual deficits; it may have residual 
excesses. A proper flow is a flow which does not generate any additional residual 
excess, except possibly at the sink. We note that a flow can be converted into a 
proper flow, by removing flow on useless paths and cycles. For a flow g we denote 
its value Ig] = eg{t) to be the residual excess at the sink. Let OPT(G) denote the 
maximum possible value of any flow in network G. A flow g is optimal in network 
G if Igl = OPT(G) and ^-optimal if \g\ > (1 — ^)OPT(G). The (approximate) 
generalized maximum flow problem is to And a (^-) optimal flow. We sometimes 
omit the adjective generalized when its meaning is clear from context. 



2.3 Residual and Relabeled Networks 

Let g be a generalized flow in network G = {V, E, s, u, 7 , e). With respect to the 
flow g, the residual capacity function is defined by Ug{v,w) = u{v,w) — g{v,w). 
The residual network is Gg = {V,E,s,Ug,^,eg). Solving the problem in the 
residual network is equivalent to solving it in the original network. 

Our algorithms use the technique of relabeling, which was originally intro- 
duced by Glover and Klingman [4]. A labeling function is a function g: V ^ 3?>o 
such that g(t) = 1. The relabeled network is Gg = {V, E, t, Ug, 7 ^, Cg), where the 
relabeled capacity, relabeled gain and relabeled initial excess functions are de- 
flned by: Ug{v,w) = u{v,w)/ g{v), jg{v,w) = j{v,w)g{v)/ g{w), and eg{v) = 
e{v)/g(v). The relabeled network provides an equivalent instance of the gen- 
eralized maximum flow problem. Intuitively, node label g{v) changes the local 
units in which flow is measured at node v, it is the number of old units per 
new unit. The inverses of the node labels correspond to the linear programming 
dual variables, for the primal problem with decision variables g{v,w). With re- 
spect to a flow g and labels g we define the the relabeled residual network by 
Gg^g = (V, E,t,Ug^g,jg, 6g^g), whece the relabeled residual capacity and relabeled 
residual excess functions are defined by Ug^g{v,w) = {u{v,w) — g{v,w)) / g{v) and 
e-g,g{v) = eg{v) / g{v). We define the canonical label of a node v in network G to 
be the inverse of the highest gain residual path from v to the sink. If G has no 
residual flow-generating cycles, then we can compute the canonical labels using 
a single shortest path computation with costs c(v,w) = — logj(v, w). 



2.4 Optimality Conditions 

An augmenting path is a residual path from a node with residual excess to the 
sink. A generalized augmenting path (GAP) is a residual flow-generating cycle, 
together with a (possibly trivial) residual path from a node on this cycle to the 
sink. By sending flow along augmenting paths or GAPs we increase the net flow 
into the sink. The following theorem of Onaga [19] says that the nonexistence of 
augmenting paths and GAPs implies that the flow is optimal. 
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Theorem 1. A flow g is optimal in network G if and only if there are no aug- 
menting paths or GAPs in Gg. 

2.5 Finding a Good Starting Flow 

Our approximation algorithms require a rough estimate of the optimum value in 
a network. Radzik [20] proposed a 0{mf) time greedy augmentation algorithm 
that finds a flow that is within a factor n of the optimum. His greedy algorithm 
repeatedly sends flow along highest-gain augmenting paths, but does not use 
arcs in “backward” direction. Using this algorithm, we can determine an initial 
parameter Z\q which satisfies OPT(G) < Z\o < nOPT(G). 

2.6 Canceling Flow- Generating Cycles 

Subroutine CancelCycles converts a generalized flow g into another general- 
ized flow g' whose residual network contains no flow-generating cycles. In the 
process, the net flow into every node, including the sink, can only be increased. It 
also flnds node labels g. so that G^y^ is a lossy network. This subroutine is used 
by all of our algorithms. CancelCycles was designed by Goldberg, Plotkin, 
and Tardos and is described in detail in [7]. It is an adaptation of Goldberg and 
Tarjan’s [9] cancel-and-tighten algorithm for the minimum cost flow problem 
using costs c{v,w) = — logj 7 (u,w) for any base 6 > 1. Note that negative cost 
cycles correspond to flow-generating cycles. In Section 7 we discuss practical 
implementation issues. 

Theorem 2. Let b > 1. If all of the costs are integral and at least —G, then 
CancelCycles runs in G(mn log G) time. 

In generalized flows, the costs will typically not be integral. In this case, the next 
theorem is useful. 

Theorem 3. If the gains are given as ratios of integers between 1 and B, then 
CancelCycles requires 0{mn^ log B) time. 

Radzik [21] showed that CancelCycles runs in strongly polynomial-time. We 
have a variant that limits the relabeling increases, allowing a simpler proof of 
the strongly polynomial running time. 

2.7 Nearly Optimal Flows 

The following lemma derived from [7] says that if a flow is ^-optimal for suffi- 
ciently small then we can efficiently convert it into an optimal flow. It is used 
to provide termination of our exact algorithms. The conversion procedure in- 
volves one call to CancelCycles and a single (nongeneralized) maximum flow 
computation. 

Lemma 1. Given a B~^™ -optimal flow, we can compute an optimal flow in 
0{mn^ log B) time. 
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3 Gain-Scaling 

In this section we present a rounding and scaling framework. Together, these 
ideas provide a technique which can be viewed as a type of gain-scaling. By 
rounding the gains, we can improve the complexity of many generalized flow 
computations (e.g., canceling flow-generating cycles above). However, our ap- 
proximation from rounding creates error. Using an iterative or recursive ap- 
proach, we can gradually refine our approximation, until we obtain the desired 
level of precision. 



3.1 Rounding Down the Gains 

In our algorithms we round down the gains so that they are all integer powers 
of a base 6 = (1 -I- Our rounding scheme applies in lossy networks, i.e., 

networks in which no residual arc has a gain factor above one. This implies that 
the network has no residual flow-generating cycles. We round the gain of each 
residual arc down to j{v,w) = where c{v,w) = — [logf, 7 (ri, w)J , main- 

taining antisymmetry by setting j{w,v) = l/j{v,w). Note that if both (v,w) 
and (w, v) are residual arcs, then each has unit gain, ensuring that 7 is well- 
defined. Let H denote the resulting rounded network. H is also a lossy network, 
sharing the same capacity function with G. Let ft, be a flow in network H . The 
interpretation of flow ft as a flow in network G is defined by: g{v,w) = h{v,w) 
if g{v,w) > 0 and g{v,w) = —j{w,v)h{w,v) if g{v,w) < 0. Flow interpretation 
in lossy networks may create additional excesses, but no deficits. We show that 
approximate flows in the rounded network induce approximate flows in the orig- 
inal network. First we show that the rounded network is close to the original 
network. 

Theorem 4. Let G be a lossy network and let H be the rounded network con- 
structed as above. // 0 < ^ < 1 then (1 — f)OPT(G) < OPT(H) < OPT(G). 



Proof. Clearly OPT(ift) < OPT(G) since we only decrease the gain factors of 
residual arcs. To prove the other inequality, we consider the path formulation 
of the maximum flow problem in lossy networks. We include a variable Xj for 
each path Pj, representing the amount of flow sent along the path. Let x* be an 
optimal path flow in G. Then x* is also a feasible path flow in H . From path Pj, 
"f{Pj)x* units of flow arrive at the sink in network G, while only j(Pj)x* arrive 
in network H . The theorem then follows, since for each path Pj, 



iiPj) > 



liPj) 

ft'"' 



> 



ijPj) 
i + ? 



> )(!-?)• 



Corollary 1. Let G be a lossy network and let H be the rounded network con- 
structed as above. I/O < ^ < 1 then the interpretation of a f -optimal flow in H 
is a f -\- f -optimal flow in G. 
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Proof. Let ft- be a ^'-optimal flow in H . Let g be the interpretation of flow ft in 
G. Then we have 

\g\ > \h\ > (1 - aOPT{H) > (1 - e)(l - ?')OPT(G) > (1 - ? - ?')OPT(G). 

3.2 Error-Scaling and Recursion 

In this section we describe an error-scaling technique which can be used to speed 
up computations for generalized flow problems. Radzik [20] proposed a recur- 
sive version of error-scaling to improve the complexity of his Fat-Path variant 
when finding nearly optimal and optimal flows. We use the technique in a sim- 
ilar manner to speed up our Fat-Path variant. We also use the idea to convert 
constant-factor approximation algorithms into fully polynomial-time approxima- 
tion schemes. 

Suppose we have a subroutine which finds a 1/2-optimal flow. Using error- 
scaling, we can determine a ^-optimal flow in network G by calling this subroutine 
log 2 (l/'C) times. To accomplish this we first And a 1/2-optimal flow g in network 
G. Then we And a 1/2-optimal flow ft in the residual network Gg. Now g h is 
a 1 /4-optimal flow in network G, since each call to the subroutine captures at 
least half of the remaining flow. In general, we can And a ^-optimal flow with 
log 2 (l/'C) calls to the subroutine. 

The following lemma of Radzik [20] is a recursive version of error-scaling. 
It says that we can compute an ^-optimal flow by combining two appropriate 
V^-optimal flows. 

Lemma 2. Let g he a ^/f-optimal flow in network G. Let h he a ^/^-optimal 
flow in network Gg. Then the flow g h is ^-optimal in G. 

4 Truemper’s Algorithm 

Truemper’s maximum flow based augmenting path algorithm is one of the sim- 
plest algorithms for the generalized maximum flow problem. We apply our gain- 
scaling techniques to Truemper’s algorithm, producing perhaps the cleanest and 
simplest polynomial-time algorithms for the problem. In this section we first 
review Truemper’s [22] algorithm. Our first variant runs Truemper’s algorithm 
in a rounded network. It computes a ^-optimal flow in polynomial-time, for 
any constant ^ > 0. However, it requires exponential-time to compute optimal 
flows, since we would need ^ to be very small. By incorporating error-scaling, we 
show that a simple variant of Truemper’s algorithm computes an optimal flow 
in polynomial-time. 

A natural and intuitive algorithm for the maximum flow problem in lossy net- 
works is to repeatedly send flow from excess nodes to the sink along highest-gain 
(most-efficient) augmenting paths. Onaga observed that if the input network has 
no residual flow-generating cycles, then the algorithm maintains this property. 
Thus, we can And a highest-gain augmenting path using a single shortest path 
computation with costs c(u, w) = — log 7 (u, w). By maintaining canonical labels. 
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we can ensure that all relabeled gains are at most one, and a Dijkstra shortest 
path computation suffices. Unit gain paths in the canonically relabeled network 
correspond to highest gain paths in the original network. This is essentially 
Onaga’s [19] algorithm. If the algorithm terminates, then the resulting flow is 
optimal by Theorem 1. However, this algorithm may not terminate in finite time 
if the capacities are irrational. Truemper’s algorithm [22] uses a (nongeneralized) 
maximum flow computation to simultaneously augment flow along all highest- 
gain augmenting paths. It is the generalized flow analog of Jewell’s primal-dual 
minimum cost flow algorithm. 

Theorem 5. In Truemper’s algorithm, the number of maximum flow computa- 
tions is hounded by n plus the number of different gains of paths in the original 
network. 

Proof. After each maximum flow computation, /i(u) strictly increases for each 
excess node v ^ t. 

4.1 Rounded Truemper (RT) 

Algorithm RT computes a ^-optimal flow by running Truemper’s algorithm in 
a rounded network. The input to Algorithm RT is a lossy network G and an 
error parameter f. Algorithm RT first rounds the gains to integer powers of 
& = (1 -I- described in Section 3.1. Let H denote the rounded network. 

Then RT computes an optimal flow in H using Truemper’s algorithm. Finally the 
algorithm interprets the flow in the original network. Algorithm RT is described 
in Figure 1. 



Input: lossy network G, error parameter 0 < ^ < 1 
Output: ^optimal flow g 

Set base fe = (1 + and round gains in G to powers of b 
Let H be resulting network 
Initialize h ^ 0 

while 3 augmenting path in Hh do 
/i ^ canonical labels in Hh 

f ^ max flow from excess nodes to the sink in Hh.^ using only gain one 
relabeled residual arcs 
h(v, w) ^ h(v, w) -b f{v, w)fj,{v) 

end while 

g ^ interpretation of flow h in G 

Fig. 1: RT(G,?). 



The gain of a path in network G is between and R". Thus, after round- 
ing to powers of b, there are at most 1 3- logj log B) different 

gains of paths in H. Using Goldberg and Tarjan’s [8] preflow-push algorithm, 
each (nongeneralized) maximum flow computation takes 0(mn) time. Thus, by 
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Theorem 5, RT finds an optimal flow in H in 0{mn^^ ^logR) time. The fol- 
lowing theorem follows using Corollary 1. 

Theorem 6. Algorithm RT computes a ^-optimal flow in a lossy network in 
log B) time. 

4.2 Iterative Rounded Truemper (IRT) 

RT does not compute an optimal flow in polynomial-time, since the precision 
required to apply Lemma 1 is roughly ^ = B~^. In Algorithm IRT, we apply 
error-scaling, as described in Section 3.2. IRT iteratively calls RT with error 
parameter 1/2 and the current residual network. Since RT sends flow along 
highest-gain paths in the rounded network, not in the original network, it cre- 
ates residual flow-generating cycles. So, before calling RT in the next iteration, 
we must first cancel all residual flow-generating cycles with subroutine Cancel- 
Cycles, because the input to RT is a lossy network. Intuitively, this can be 
interpreted as rerouting flow from its current paths to highest-gain paths, but 
not all of the rerouted flow reaches the sink. 

Theorem 7. Algorithm IRT computes a ^-optimal flow in 0{mn^ log B log 
time. It computes an optimal flow in O^iufln^log^ B) time. 

In the full paper we prove that Algorithm IRT actually finds an optimal flow in 
0{w?n^ log B + rrfln^ log^ B) time. 

5 Preflow-Push 

In this section we adapt Goldberg and Tarjan’s [10] preflow-push algorithm to 
the generalized maximum flow problem. This is the first polynomial-time preflow 
push algorithm for generalized network flows. Tseng and Bertsekas [23] designed 
a preflow push-like algorithm for the generalized minimum cost flow problem, 
but it may require more than R" iterations. Using our rounding technique, we 
present a preflow-push algorithm that computes a ^-optimal flow in polynomial- 
time for any constant ^ > 0. Then by incorporating error-scaling, we show how 
to find an optimal flow in polynomial-time. 



5.1 Rounded Preflow-Push (RPP) 

Algorithm RPP is a generalized flow analog of Goldberg and Tarjan’s preflow 
push algorithm for the minimum cost flow problem. Conceptually, RPP runs 
the minimum cost flow algorithm with costs c(v,w) = — logj(v,w) and error 
parameter e = ^ log 6 where 6 = (1 -I- This leads to the following natural 

definitions and algorithm. An admissible arc is a residual arc with relabeled gain 
above one. The admissible graph is the graph induced by admissible arcs. An 
active node is a node with positive residual excess and a residual path to the 
sink. We note that if no such residual path exists and an optimal solution sends 
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flow through this node, then that flow does not reach the sink. So we can safely 
disregard this useless residual excess. (Periodically RPP determines which nodes 
have residual paths to the sink.) Algorithm RPP maintains a flow h and node 
labels fx. The algorithm repeatedly selects an active node v. If there is an ad- 
missible arc (u, w) emanating from node v, IPP pushes 5 = mm{eh{v) , Uh{v, w)} 
units of flow from node u to ic. If <5 = Uh{v,w) the push is called saturating; 
otherwise it is nonsaturating. If there is no such admissible arc, RPP increases 
the label of node w by a factor of 2*^ = 6^/"; this corresponding to an additive 
potential increase for minimum cost flows. This process is referred to as a relabel 
operation. Relabeling node v can create new admissible arcs emanating from v. 
To ensure that we do not create residual flow-generating cycles, we only increase 
the label by a relatively small amount. 

The input to Algorithm RPP is a lossy network G and error parameter 
Before applying the preflow-push method, IPP rounds the gains to powers of 
6 = (1 + ^)1/", as described in Section 3.1. The method above is then applied to 
the rounded network H . Algorithm RPP is described in Figure 2. 

We note that our algorithm maintains a pseudoflow with excesses, but no 
deficits. In contrast, the Goldberg-Tarjan algorithm allows both excesses and 
deficits. Also their algorithm scales e. We currently do not see how to improve 
the worst-case complexity by a direct scaling of e. 



Input: lossy network G, error parameter 0 < ^ < 1 
Output: ^-optimal flow g 

Set base fe = (1 + and round gains in network G to powers of b. 

Let H be resulting network 
Initialize h ^ 0, g ^ 1 
while 3 active node v do 

if 3 admissible arc (v, w) then 

Push 5 = mm{eh{v),Uh{v,w)} units of flow from u to w and update h {push} 
else 

g{v) ^ {relabel} 

end if 
end while 

g ^ interpretation of flow h in G 

Fig. 2: RPP(G,a. 



The bottleneck computation is performing nonsaturating pushes, just as for 
computing minimum cost flows with the preflow-push method. By carefully 
choosing the order to examine active nodes (e.g., the wave implementation), 
we can reduce the number of nonsaturating pushes. A dual approach is to use 
more clever data structures to reduce the amortized time per nonsaturating 
push. Using a version of dynamic trees specialized for generalized networks [6], 
we obtain the following theorem. 
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Theorem 8. Algorithm RPP computes a ^-optimal flow in ^logi?) 

time. 

5.2 Iterative Rounded Preflow-Push (IRPP) 

RPP does not compute an optimal flow in polynomial time, since the precision 
required is roughly ^ = B~'^. Like Algorithm IRT, Algorithm IRPP adds error- 
scaling, resulting in the following theorem. 

Theorem 9. IRPP computes a ^-optimal flow in O {mn^ log B log time. It 
computes an optimal flow in O^iufln^log^ B) time. 

6 Rounded Fat Path 

In this section we present a simple variant of Goldberg, Plotkin, and Tardos’ [7] 
Fat-Path algorithm which has the same complexity as Radzik’s [20] Fat-Path 
variant. Our algorithm is intuitive and its proof of correctness is much simpler 
than Radzik’s. The Fat-Path algorithm can be viewed as an analog of Orlin’s 
capacity scaling algorithm for the minimum cost flow problem. The original Fat- 
Path algorithm computes a ^-optimal flow in 0(mn^ log R log time, while 
Radzik’s and our variants require only 0{m{m + n log log R) log^“^) time. 

The bottleneck computation in the original Fat-Path algorithm is cancel- 
ing residual flow-generating cycles. Radzik’s variant reduces the bottleneck by 
canceling only residual flow-generating cycles with big gains. The remaining 
flow-generating cycles are removed by decreasing the gain factors. Analyzing the 
precision of the resulting solution is technically complicated. Instead, our vari- 
ant rounds down the gains to integer powers of a base b, which depends on the 
precision of the solution desired. Our rounding is done in a lossy network, which 
makes the quality of the resulting solution easy to analyze. Subsequent calls to 
CancelCycles are performed in a rounded network, improving the complexity. 

We first review the FatAugmentations subroutine which finds augmenting 
paths with sufflciently large capacity. Then we present Algorithm RFP, which 
runs the Fat-Path algorithm in a rounded network. It computes approximately 
optimal and optimal flows in polynomial-time. We then present a recursive ver- 
sion of RFP, which improves the complexity when computing nearly optimal and 
optimal flows. 

6.1 Fat Augmentations 

The FatAugmentations subroutine was originally developed by Goldberg, 
Plotkin, and Tardos for their Fat-Path algorithm and is described in detail in [7]. 
The input is a lossy network and fatness parameter 5. The subroutine repeatedly 
augments flow along highest-gain 6-fat paths, i.e. highest-gain augmenting paths 
among paths that have enough residual capacity to increase the excess at the sink 
by 6, given sufficient excess at the first node of the path. This process is repeated 




Simple Generalized Maximum Flow Algorithms 321 



until no <5-fat paths remain. There are at most n + OPT(G)/<5 augmentations. 
By maintaining appropriate labels /r, an augmentation takes 0{m) time, using 
an algorithm based on Dijkstra’s shortest path algorithm. Upon termination, 
the final flow has value at least OPT(G) — mS. 

6.2 Rounded Fat Path (RFP) 

Algorithm RFP runs the original Fat-Path algorithm in a rounded network. 
The idea of the original Fat-Path algorithm is to call FatAugmentations and 
augment flow along i5-fat paths, until no such paths remain. At this point 6 is 
decreased by a factor of 2 and a new phase begins. However, since FatAugmen- 
tations selects only paths with large capacity, it does not necessarily send flow 
on overall highest-gain paths. This creates residual flow-generating cycles which 
must be canceled so that we can efficiently compute i5/2-fat paths in the next 
phase. 

The input to Algorithm RFP is a lossy network and an error parameter 
First, RFP rounds down the gains as described in Section 3.1. It maintains a 
flow h in the rounded network H and an upper bound A on the excess dis- 
crepancy, i.e., the difference between the value of the current flow \h\ and the 
optimum OPT(Ff). The scaling parameter A is initialized using Radzik’s greedy 
augmentation algorithm, as described in Section 2.5. In each phase, A is de- 
creased by a factor of 2. To achieve this reduction. Algorithm RFP cancels all 
residual flow-generating cycles in Hh, using the CancelCycles subroutine. By 
Theorem 2 this requires 0{mnlogC) time where G is the biggest cost. Recall 
c{v,w) = — [logf, 7(w, w)J so G < 1-l-logjR = 0{n^~^ log B). Then subroutine 
FatAugmentations is called with fatness parameter S = Aj(2m). After this 
call, the excess discrepancy is at most mS = Aj2, and A is decreased accord- 
ingly. Since each 5-fat augmentation either empties the residual excess of a node 
or increases the flow value by at least 5, there are at most n-\- Aj 5 = n 2m 
augmentations per Z\-phase, which requires a total of 0{m^) time. Algorithm 
RFP is given in Figure 3. 

Theorem 10. Algorithm RFP computes a 2^-optimal flow in a lossy network 
in F mn log(^“^ logR)) log^“^) time. 

Proof. To bound the running time, we note that there are at most log 2 (n/^) 
phases. FatAugmentations requires G(m^) time per phase, and CancelCy- 
GLES requires G(mn log G) time, where we bound G = logB) as above. 

The algorithm terminates when A < ^OPT(il). At this point h is ^-optimal in 
network H, since we maintain A > OPT(il) — \h\. The quality of the resulting 
solution then follows using Theorem 4. 



6.3 Recursive Rounded Fat-Path (RRFP) 

Algorithm RFP computes a ^-optimal flow in 0{wf -\- mn log log R) time when 
^ > 0 is inversely polynomial in m. However it may require more time to 
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Input: lossy network G, error parameter 0 < ^ < 1 
Output: 2^-optimal flow g 

Set base b = {I + and round gains in network G to powers of b 

Let H be resulting network 

Initialize Z\ ^ Z\q and ^ 0 {OPT(JI) < Zio < nOPT{H)} 

repeat 

^ CANCELCYCLES(ff;i) 

h^h + h' 

h' ^ FatAugmentations(LI>i, / r, Zi/(2m)) {OPT(LI^) — \h'\ < A/2} 

h^h + h' 

A «- A/2 

until A < ^OPT(H) 
g ^ interpretation of h in network G 

Fig. 3: RFP(G,e). 



compute optimal flows than the original Fat-Path algorithm. By using the re- 
cursive scheme from Section 3.2, we can compute nearly optimal and optimal 
flows faster than the original Fat-Path algorithm. In each recursive call, we 
reround the network. We cancel flow-generating cycles in an already (partially) 
rounded network. The benefit is roughly to decrease the average value of C from 
0{n^~^ log B) to 0{n\ogB). 

Theorem 11. Algorithm RRFP computes a ^-optimal flow in a lossy network in 
0{m(m + nloglogB)log^~^) time. If the network has residual flow-generating 
cycles, then an extra 0{mn^ log B) preprocessing time is required. Algorithm 
RRFP computes an optimal flow in 0(jn^{m nloglogB) log B) time. 

7 Practical Cycle-Canceling 

We implemented a version of the preflow-push algorithm, described in Section 5, 
in C++ using Mehlhorn and Naher’s [17] Library of Efficient Data types and 
Algorithms (LEDA). We observed that as much as 90% of the time was spent 
canceling flow-generating cycles. We focused our attention on reducing this bot- 
tleneck. 

Recall, CancelCycles is an adaption of Goldberg and Tarjan’s cancel-and- 
tighten algorithm using costs c{v,w) = —logj{v,w). Negative cost cycles cor- 
respond to flow-generating cycles. The underlying idea of the Goldberg-Tarjan 
algorithm is to cancel the most negative mean cost cycle until no negative cost 
cycles remain. To improve efficiency, the actual cancel-and-tighten algorithm 
only approximates this strategy. It maintains a flow g and node potentials (cor- 
responding to node labels) tt that satisfy e- complementary slackness. That is, 
Ctt{v, w) = c{v, w) — 7r(u) -|-7r(r<;) > — e for all residual arcs (u, w). The subroutine 
makes progress by reducing the value of e, using the following two computations 
which comprise a phase: (i) canceling residual cycles in the subgraph induced by 
negative reduced cost arcs and (ii) updating node potentials so that finding new 
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negative cost (flow-generating) cycles is efficient. The cancel-and-tighten algo- 
rithm uses the following two types of potential updates. Loose updating uses a 
computationally inexpensive topological sort, but may only decrease e by a fac- 
tor of (1 — 1/n) per phase. Tight updating reduces e by the maximum possible 
amount, but involves computing the value of the minimum mean cost cycle e*, 
which is relatively expensive. 

We observed that the quality of the potential updates was the most impor- 
tant factor in the overall performance of CancelCycles. So, we focused our 
attention on limiting the number of iterations by better node potential updates. 
Using only loose updates, we observed that CancelCycles required a large 
number of phases. By using tight updates, we observed a significant reduction in 
the number of phases, but each phase is quite expensive. The goal is to a reach 
a middle ground. We introduce a medium updating technique which is much 
cheaper than tight updating, yet more effective than loose updating; it reduces 
the overall running time of the cycle canceling computation. Our implementation 
uses a combination of loose, medium, and tight potential updates. 

Tight updating requires 0(mn) time in the worst case, using either a dynamic 
programming or binary search method. We incorporated several heuristics to 
improve the actual performance. These heuristics are described in the full paper. 
However, we observed that tight relabeling was still quite expensive. 

We introduce a medium potential updating which is a middle ground between 
loose and tight updating. In medium updating, we find a value d which is close 
to e*, without spending the time to find the actual minimum mean cost cycle. 
In our algorithm, we only need to estimate e* in networks where the subgraph 
induced by negative cost arcs is acyclic. To do this efficiently, we imagine that 
the in addition to the original arcs, a zero cost link exists between every pair 
of nodes. We can efficiently find a minimum mean cost cycle in this modified 
network by computing a minimum mean cost path in the acyclic network induced 
by only negative cost arcs, without explicitly considering the imaginary zero cost 
arcs. Let d denote the value of the minimum mean cost cycle in the modified 
network. Clearly d < e*, and it is not hard to see that e' > (1 — l/n)e. We 
can binary search for d using a shortest path computation in acyclic graphs. 
This requires only 0(m) time per iteration. If we were to determine d exactly, 
in O(nlogB) iterations the search interval would be sufficiently small. If the 
gains in the network are rounded to powers of 6 = (1-1- then O(logC) 

iterations suffice, where C = 0(n^~^ log B). In our implementation we use an 
approximation to e'. 
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Abstract. We introduce an algorithm that solves the maximum flow 
problem without generating flows explicitly. The algorithm solves di- 
rectly a problem we call the maximum s-excess problem. That problem 
is equivalent to the minimum cut problem, and is a direct extension of 
the maximum closure problem. The concepts used also lead to a new 
parametric analysis algorithm generating all breakpoints in the amount 
of time of a single run. 

The insights derived from the analysis of the new algorithm lead to a new 
simplex algorithm for the maximum flow problem - a pseudoflow-based 
simplex. We show that this simplex algorithm can perform a parametric 
analysis in the same amount of time as a single run. This is the hrst 
known simplex algorithm for maximum flow that generates all possible 
breakpoints of parameter values in the same complexity as required to 
solve a single maximum flow instance and the fastest one. 

The complexities of our pseudoflow algorithm, the new simplex algo- 
rithm, and the parametric analysis for both algorithms are 0(mn log n) 
on a graph with n nodes and m arcs. 



1 Introduction 

This extended abstract describes an efficient new approach to the maximum 
flow and minimum cut problems. The approach is based on a new certificate 
of optimality inspired by the algorithm of Lerchs and Grossmann, [LG64] . This 
certificate, called normalized tree, partitions the set of nodes into subsets some of 
which have excess capacity and some have capacity deficit. The nodes that belong 
to the subsets with excess form the source set of a candidate minimum cut. The 
algorithm solves, instead of the maximum flow problem, another problem which 
we call the maximum s- excess problem. That problem is defined on a directed 
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graph with arc capacities and node weights and does not contain distinguished 
source and sink nodes. The objective of the s-excess problem is to find a subset 
of the nodes that maximizes the sum of node weights, minus the weight of 
the arcs separating the set from the remainder of the nodes. The new problem 
is shown to be equivalent to the minimum cut problem that is traditionally 
solved by deriving a maximum flow first. With the new algorithm these problems 
can be solved without considering flows explicitly. The steps of the algorithm 
can be interpreted as manipulating pseudoflow - a flow that does not satisfy 
flow balance constraints. For this reason we choose to call the algorithm the 
pseudoflow algorithm. 

The main feature that distinguishes the pseudoflow algorithm from other 
known algorithms for the maximum flow problem is that it does not seek to either 
preserve or progress towards feasibility. Instead the algorithm creates “pockets” 
of nodes so that at optimum there is no residual arc that can carry additional 
flow between an “excess pocket” and a “deficit pocket”. The set of nodes in 
all the “excess pockets” form the source set of a minimum cut and also the 
maximum s-excess set. 

The certificate maintained by our algorithm bears a resemblance to the ba- 
sic arcs tree maintained by simplex. It is demonstrated that this certificate is 
analogous to the concept of a strong basis introduced by Cunningham [C76] 
if implemented in a certain “extended network” permitting violations of flow 
balance constraints. It is further shown that the algorithmic steps taken by our 
algorithm are substantially different from those of the simplex and lead to a 
different outcome in the next iteration based on the same strong basis in the 
given iteration. 

The contributions in this paper include: 

1 . The introduction of the pseudoflow maximum s-excess problem that provides 
a new perspective on the maximum flow problem. 

2. A pseudoflow algorithm for the maximum flow problem of complexity 
0{mn log n). 

3. Parametric analysis conducted with the pseudoflow algorithm that generates 
all breakpoints in the same complexity as a single run. 

4. A new pseudoflow-based simplex algorithm for maximum flow using the low- 
est label approach. 

5. A parametric analysis simplex method that finds all possible parameter 
breakpoints in the same time as a single run, 0(mn log n). 

The parametric simplex method is the first known parametric implementa- 
tion of simplex to date that finds all breakpoints in the same running time as a 
single application. 



1.1 Notation 

For P,Q C V, the set of arcs going from P to Q is denoted by, (P, Q) = {(m, v) G 
A\u G P and v G Q}. Let the capacity of arc (u, v) be denoted by Cuv or c(u, v). 
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For P,Q C V, P n Q = 0, the capacity of the cut separating P from Q is, 
C{P, Q) = E(„.„)6(p.q) c™. For SCV,letS = V\S. 

For a graph G = {V,A) we denote the number of arcs by m = \A\ and the 
number of nodes by n = |y|. 

An arc (u,u) of an unspecified direction is referred to as edge [m, u]. 

[vi,V 2 , ■ ■ ■ , Vk] denotes an undirected path from v\ to Vk- That is, 

[VI,V2], . . . , [vk-l,Vk\ G A. 

We use the convention that the capacity of an arc that is not in the graph is 
zero. Thus for (a,b) G A and (b,a) ^ A, ct^a = 0. 

The capacity of an edge e is denoted either by c(e) or Cg. The flow on an edge 
e is denoted either by /(e) or /g. We use the convention that /(a, b) = —f{b, a). 
For a given flow or pseudoflow /, the residual capacity of e is denoted by c/(e) 
which is Cg — /g. 

Given a rooted tree, T, T„ is the subtree suspended from node v that contains 
all the descendants of v in T. T\^v,p{v)] = Tv is the subtree suspended from the 
edge [u,p(u)]. An immediate descendant of a node u, a child of u, is denoted by 
ch{v), and the unique immediate ancestor of a node v, the parent of v, by p{v). 

2 The Maximum Flow Problem and the Maximum 
s-Excess Problem 

The pseudoflow algorithm described here ultimately finds a maximum flow in 
a graph. Rather than solving the problem directly the algorithm solves instead 
the maximum s-excess problem. 

Problem Name: Maximum s- Excess 

Instance: Given a directed graph G = (V, A), node weights (positive or neg- 
ative) Wi for all i GV, and nonnegative arc weights Cij for all (i,j) G A. 
Optimization Problem: Find a subset of nodes S CV such that 
ieSjeS^ij 'is maximum. 

We elaborate here further on this problem and its relationship to the maxi- 
mum flow and minimum cut problems. 

The maximum flow problem is defined on a directed graph with distinguished 
source and sink nodes and the arcs adjacent to source and sink, A(s) and A{t), 
Gst = {VU {s, t}, A U A(s) U A(t)). 

The standard formulation of the maximum flow problem with zero lower 
bounds and Xij variables indicating the amount of flow on arc (i,j) is. 

Max xts 

subject to - J2j Xjk = 0 k gV 

0 'G Xij "G dj V(z,j) G A. 

In this formulation the first set of (equality) constraints is called the flow bal- 
ance constraints. The second set of (inequality) constraints is called the capacity 
constraints. 
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Definition 1. The s- excess capacity of a set S C V in the graph Gst = U 
{s, t}, A U A(s) U A(t)) is, C({s}, -S') - C{S, S U {t}). 

We claim that finding a subset S ffV that maximizes C({s}, S) — C{S, S U 
{t}) is equivalent to the maximum s-excess problem. 

Lemma 2. A subset of nodes S C V maximizes C'({s},5') — C{S,Syj {t}) in 
Gst if and only if it is of maximum s-excess in the graph G = {V, A). 

We next prove that the s-excess problem is equivalent to the minimum cut 
problem, 

Lemma 3. S is the source set of a minimum cut if and only if it is a set of 
maximum s- excess capacity C{s, S) — G{S, S U {t}) in the graph. 

Proof. Given an instance of the s-excess problem. Append to the graph G = 
{V,A) the nodes s and t; Assign arcs with capacities equal to the weights of 
nodes from s to the nodes of positive weight; Assign arcs with capacities equal 
to the absolute value of the weights of nodes of negative weights, from the nodes 
to t. 

The sum of weights of nodes in S is also the sum of capacities Cds}, S) — 
C(S', {t}) where the first term corresponds to positive weights in S, and the 
second to negative weights in S: 

= C'({4, S) - C{S, {t}) - G(S, S) 

= G{{s},S)-G{S,SU{t}). 

= G({4, y) - C({s}, S) - G{S, S U {t}). 

= G{{s},V)-G{{.s}US,SU{t}). 

The latter term is the capacity of the cut ({s}US', ;SU{t}). Hence maximizing the 
s-excess capacity is equivalent to minimizing the capacity of the cut separating 
s from t. 

To see that the opposite is true, consider the network (V U {s,t},A) with 
arc capacities. Assign to node v G V a weight that is Csv if the node is adjacent 
to s and — c„t if the node is adjacent to t. Note that it is always possible to 
remove paths of length 2 from s to t thus avoiding the presence of nodes that 
are adjacent to both source and sink. This is done by subtracting from the arcs’ 
capacities Csv,Cvt the quantity min{ Cg„, c„t}. The capacities then translate into 
node weights that serve as input to the s-excess problem and satisfy the equalities 
above. □ 

We conclude that the maximum s-excess problem is a complement of min- 
imum cut which in turn is a dual of the maximum flow problem. As such, its 
solution does not contain more flow information than the solution to the min- 
imum cut problem. As we see later, however, it is possible to derive a feasible 
flow of value equal to that of the cut from the certificate used in the algorithm, 
in 0{mn) time. 
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The reader may wonder about the arbitrary nature of the s-excess problem, 
at least in the sense that it has not been previously addressed in the literature. 
The explanation is that this problem is a relaxation of the maximum closure 
problem where the objective is to find, in a node weighted graph, a closed set of 
nodes of maximum total weight. In the maximum closure problem it is required 
that all successors of each node in the closure set will belong to the set. In the 
s-excess problem this requirement is replaced by a penalty assigned to arcs of 
immediate successors that are not included in the set. In that sense the s-excess 
problem is a relaxation of the maximum closure problem. The proof of Lemma 
3 is effectively an extension of Picard’s proof [Pic76] demonstrating that the 
maximum weight closed set in a graph (maximum closure) is the source set of a 
minimum cut. 

We provide a detailed account of the use of the pseudoflow and other algo- 
rithms for the maximum closure problem in [Hoc96] . We also explain there how 
these algorithms have been used in the mining industry and describe the link to 
the algorithm of Lerchs and Grossmann, [LG64] . 



3 Preliminaries and Definitions 

Pseudoflow is an assignment of values to arcs that satisfy the capacity constraints 
but not necessarily the flow balance constraints. Unlike preflow, that may violate 
the flow balance constraints only with inflow exceeding outflow, pseudoflow per- 
mits the inflow to be either strictly larger than outflow (excess) or outflow strictly 
larger than inflow (deficit). Let / be a pseudoflow vector with 0 < fij < Cij the 
pseudoflow value assigned to arc (f,j). Let in flow {D), out flow {D) be the total 
amount of flow incoming and outgoing to and from the set of nodes D. For each 
subset of nodes D CV, 

excess{D) = inflow (D) — outflow (D) 

— S(m,j;)g(FU{s}\D,D) ~ S(«,«)G(D,yu{t}\D) /«.“■ 

The absolute value of excess that is less than zero is called deficit, i.e. 
— excess{D) = deficit{D). 

Given a rooted tree T. For a subtree T„ = T[„_p(„)], let M[„ = 

excessfiPy) and be called the mass of the node v or the arc [v,p{v)]. That is, 
the mass is the amount of flow on (v,p{v)) directed towards the root. A flow 
directed in the opposite direction ~ from p{v) to u - is interpreted as negative 
excess or mass. 

We define the extended network as follows: The network Gst is augmented 
with a set of arcs - two additional arcs per node. Each node has one arc of 
infinite capacity directed into it from the sink, and one arc of infinite capacity 
directed from it to the source. This construction is shown in Figure 1. We refer 
to the appended arcs from sink t as the deficit arcs and the appended arcs to the 
source s as the excess arcs. The source and sink nodes are compressed into a ‘root’ 
node r. We refer to the extended network’s set of arcs as These include, in 
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excess arc deficit arc 




Fig. 1. An extended network with excesses and deficits. 



addition to the deficit and excess arcs, also the arcs adjacent to source and sink 
- A(s) and A{t). The extended network is the graph = (F u {r}, 

Any pseudofiow on a graph has an equivalent feasible flow on the extended 
network derived from the graph - a node with excess sends the excess back to 
the source, and a node with deficit receives a flow that balances this deficit from 
the sink. 

Throughout our discussion of flows on extended networks, all the flows con- 
sidered saturate the arcs adjacent to source and sink and thus the status of 
these arcs, as saturated, remains invariant. We thus omit repeated reference to 
the arcs A(s) and A{t). 

4 A Normalized Tree 

The algorithm maintains a construction that we call a normalized tree after the 
use of this term by Lerchs and Grossmann in [LG64] for a construction that 
inspired ours. Let node r serve as root and represent a contraction of s and 
t. Let {y U {r}, T) be a tree where T C A. The children of r are called the roots 
of their respective branches or subtrees. The deficit and excess arcs are only used 
to connect r to the roots of the branches. 

A normalized tree is a rooted tree in r that induces a forest in (V,A). We 
refer to each rooted tree in the forest as branch. A branch of the normalized 
tree rooted at a child of r, r^, Tr.^ is called strong if excess{TrJ > 0, and weak 
otherwise. All nodes of strong branches are considered strong, and all nodes of 
weak branches are considered weak. 

A normalized tree is depicted in Figure 2. 




A New Max Flow Algorithm 331 




Fig. 2. A normalized tree. Each is a root of a branch. 



Consider a rooted forest T \ {r} in G = (F, A) and a pseudoflow / in Gst 
satisfying the properties: 

Property 4. The pseudoflow f saturates all source-adjacent arcs and all sink- 
adjacent arcs. 

Property 5. In every branch all downwards residual capacities are strictly pos- 
itive. 

Property 6. The only nodes that do not satisfy flow balance constraints are the 
roots of their respective branches that are adjacent to r in the extended network. 



Definition 7. A tree T with pseudoflow f is called normalized if it satisfies 
properties j, 5, and 6. 

Property 6 means, in other words, that in order for T to be a normalized 
tree, / has to satisfy flow balance constraints in the extended network with only 
the roots of the branches permitted to send/receive flows along excess or deflcit 
arcs. 

We let the excess of a normalized tree T and pseudoflow / be the sum of 
excesses of its strong branches (or strong nodes). Property 4 implies that the 
excess of a set of strong nodes in a normalized tree, S, satisfies: 

excess{S) = inflow{S) — outflow{S) = /({s}, S) + f{S, S) — f{S, S) 
-f(S, {t}) = G({s}, S) - C(S, {t}) + f{S, S) - f(S, S). 

This equality is used to prove the superoptimality of the set of strong nodes (in 
the superoptimality Property). 

We choose to work with normalized trees that satisfy an optional property 
stronger than 5: 
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Property 8 (Unsaturated arcs property). The tree T has all upwards resid- 
ual capacities strictly positive. 

Another optional implied property is: 

Property 9. All “free” arcs of A with flows strictly between lower and upper 
hound are included in T . 

With this property all arcs that are not adjacent to root are free. T is thus the 
union of all free arcs including some excess and deficit arcs. The only arcs in T 
that are not free are deficit arcs with 0 flow - adjacent to 0-deflcit branches. All 
out of tree arcs are thus at their upper or at the lower bounds. 

The next property - the superoptimality property - is satisfied by any nor- 
malized tree, or equivalently, by any tree-pseudoflow pair satisfying properties 
4, 5, and 6. The superoptimality of a normalized tree and the conditions for 
optimality are stated in the next subsection. 

4.1 The Superoptimality of a Normalized Tree 

Property 10 (Superoptimality). The set of strong nodes of the normalized 
tree T is a superoptimal solution to the s-excess problem: The sum of excesses 
of the strong branches is an upper hound on the maximum s-excess. 

From the proof of the superoptimality property it follows that when all arcs 
from S to S are saturated, then no set of nodes other than S has a larger s-excess. 
We thus obtain the optimality condition as a corollary to the superoptimality 
property. 

Corollary 11 (Optimality couditiou). Given a normalized tree, a pseud- 
oflow f and the collection of strong nodes in the tree S. If / saturates all arcs 
in {S, S) then S is a maximum s-excess set in the graph. 

The next Corollary holds only if Property 8 is satisfied. It implies minimality 
of the optimal solution set S. 

Corollary 12 (Miuimality). Any proper subset of the strong nodes is not a 
maximum s-excess set in (V,A'). 

On the other hand, it is possible to append to the set of strong nodes S any 
collection of 0-deflcit branches that have no residual arcs to weak nodes without 
changing the value of the optimal solution. This leads to a maximal maximum 
s-excess set. 

5 The Description of the Pseudoflow Algorithm 

The algorithm maintains a superoptimal s-excess solution set in the form of 
the set of strong nodes of a normalized tree. That is, the sum of excesses of 
strong branches is only greater than the maximum s-excess. Each strong branch 
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forms an “excess pocket” with the total excess of the branch assigned to its root. 
Within each branch the pseudoflow is feasible. 

Each iteration of the algorithm consists of identifying an infeasibility in the 
form of a residual arc from a strong node to a weak node. The arc is then added 
in and the tree is updated. The update consists of pushing the entire excess 
of the strong branch along a path from the root of the strong branch to the 
merger arc (s', w) and progressing towards the root of the weak branch. The first 
arc encountered that does not have sufficient residual capacity to accommodate 
the pushed flow gets split and the subtree suspended from that arc becomes a 
strong branch with excess equal to the amount of flow that could not be pushed 
through the arc. The process continues along the path till the next bottleneck 
arc is encountered 

Recall that G = (V,A), and Af is the set of residual arcs with respect to a 
given pseudoflow /. 

5.1 The Pseudoflow Algorithm 



begin 

Initialize V(s,j) G A(s), /(s,j) = c(s,j). V(j,t) G A{t), f{j,t) = c(j,t). 
For all arcs (i,j) G A, f{i,j) = 0. 

T = Ujgy[r, j], the branches of the tree are {j}j^v- 
Nodes with positive excess are strong, S, and the rest are weak, W. 
while (S', W) n A/ yf 0 do 
Select (s',w) G (S, IT) 

Merge T ^ T\[r, r^/] U (s', w). 

Renormalize 

Push S = units of flow along the path , s' ,w, . . . , r^, r]: 

begin 

Let [vi,Vi+i] be the next edge on the path. 

If c/(ui,Uj+i) > 5 augment flow by 5, /(ui,Uj+i) ^ /(ui,Uj+i) + 5. 
Else, split {(ui,z;j+i),(5 - Cf{vi,v^+l)}. 

Set 6 ^ Cf{v^,Vi+i). 

Set /(uj, Vi+i) ^ c{v^, Vi+i); i ^i+l 

end 

end 

end 

procedure split {{a,b), M} 

T ^ T\{a,b)U (r, a); = M. 

The branch Ta is strong or 0-deflcit with excess M . 

Af ^ A/U{(6,a)}\{(a,6)}. 



The push step, in which we augment flow by S if Cf{vi,Vi+i) > S, can be 
replaced by augmentation if c/(ui,Ui+i) > 6. The algorithm remains correct, 
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but the set of strong nodes is no longer minimal among maximum s-excess sets, 
and will not satisfy Property 8. We prove in the expanded version of this paper 
that the tree maintained is indeed normalized, which establishes the algorithm’s 
correctness. 



5.2 Initialization 

We choose an initial normalized tree with each node as a separate branch for 
which the node serves as root. The corresponding pseudoflow saturates all arcs 
adjacent to source and to sink. Thus all nodes adjacent to source are strong 
nodes, and all those adjacent to sink are weak nodes. All the remaining nodes 
have zero inflow and outflow, and are thus of 0 deficit and set as weak. If a node 
is adjacent to both source and sink, then the lower capacity arc among the two 
is removed, and the other has that value subtracted from it. Therefore each node 
is uniquely identified with being adjacent to either source or sink or to neither. 



5.3 Termination 

The algorithm terminates when there is no residual arc between any strong and 
weak nodes. 

From Corollary 12 we conclude that at termination the set of strong nodes 
is a minimal source set of a minimum cut. In other words, any proper subset 
of the set of strong nodes cannot be a source set of a minimum cut. It will be 
necessary to identify additional optimal solutions, and in particular a minimum 
cut with maximal source set for the parametric analysis. 

To that end, we identify the 0-deflcit branches among the weak branches. The 
set of strong branches can be appended with any collection of 0-deflcit branches 
without residual arcs to weak nodes for an alternative optimal solution. The 
collection of all such 0-deflcit branches with the strong nodes forms the sink set 
of a minimum cut that is maximal. To see that, consider an analogue of Corollary 
12 that demonstrates that no proper subset of a weak branch (of negative deficit) 
can be in a source set of a minimum cut. 



5.4 About Normalized Trees and Feasible Flows 

Given any tree in the extended network, and a specification of pseudoflow values 
on the out of tree arcs, it is possible to determine in linear time 0(n) whether 
the tree is normalized. If the tree is not normalized, then the process is used to 
derive a normalized tree which consists of a subset of the arcs of the given tree. 
Given a normalized tree it is possible to derive in 0{n) time the values of the 
pseudoflow on the tree arcs, and in time 0{mn) to derive an associated feasible 
flow. At termination, that feasible flow is a maximum flow. 
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5.5 Variants of Pseudoflow Algorithm and Their Complexity 

Implementing the pseudoflow algorithm in its generic form results in complexity 
of 0(nM+) iterations, for M+ = C({s},V) - the sum of all capacities of arcs 
adjacent to source. This running time is not polynomial. 

A natural variant to apply is capacity scaling. Capacity scaling is imple- 
mented in a straightforward way with running time of 0{mnlog M), where M is 
the largest capacity of source-adjacent or sink-adjacent arcs. This running time 
is polynomial but not strongly polynomial. 

Our strongly polynomial variant relies on the lowest label selection rule of a 
merger arc. With this selection rule, our algorithm runs in strongly polynomial 
time, 0{mn\ogn). 

The lowest label selection rule is described recursively. Initially all nodes are 
assigned the label 1, = 1 for all v € V. The arc (s', w) selected is such that w 

is a lowest label weak node among all possible active arcs. 

Upon a merger using the arc (s',w) the label of the strong node s' becomes 
the label of w plus 1 and all nodes of the strong branch with labels smaller than 
that of s' are updated to be equal to the label of s': formally, is' <— -I- 1 and 

for all nodes v in the same branch with s', iy <— ma,x{iy,£s'}. 

The lowest label rule guarantees a bound of mn on the total number of 
iterations. In the phase implementation all weak nodes of the same label are 
processed in one phase. The phase implementation runs in time 0(mnlogn). 

The lowest label implementation of the algorithm is particularly suitable for 
parametric implementation. The algorithm has features that make it especially 
easy to adjust to changes in capacities. The common type of analysis finding all 
breakpoints in parametric capacities of arcs adjacent to source and sink that are 
linear functions of the parameter A can also be implemented in O (mn log n). 



6 Pseudoflow-Based Simplex 

The simplex algorithm adapted to the s-excess problem maintains a pseudoflow 
with all source and sink adjacent arcs saturated. At termination, the optimal 
solution delivered by this s-excess version of simplex identifies a minimum cut. 
The solution is optimal when only source adjacent nodes have positive excess 
and only sink adjacent nodes have deficits. Additional running time is required 
to reconstruct the feasible flows on the given tree, that constitute maximum 
flow. 

We show in the full version of the paper that the concept of a strong basis, 
introduced by Cunningham [C76], is a tree in the extended network satisfying 
Properties 5, 6 and 9. 



6.1 Pseudoflow-Based Simplex Iteration 

An entering arc is a merger arc. It completes a cycle in the residual graph. It 
is thus an arc between two branches. We include an auxiliary arc from sink to 
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source, thus the merger arc completes a cycle. Alternatively we shrink source 
and sink into a single node r as before. 

Nodes that are on the source side of the tree are referred to as strong, and 
those that are on the sink side, as weak, with the notation of S and W respec- 
tively. Let an entering arc with positive residual capacity be (s',w). The cycle 
created is [r, rs> , ■ ■ ■ , s' ,w, . . . , r^j, r]. 

The largest amount of the flow that can be augmented along the cycle is 
the bottleneck residual capacity along the cycle. The first arc attaining this 
bottleneck capacity is the leaving arc. 

In the simplex the amount of flow pushed is determined by the bottleneck 
capacity. In the pseudoflow algorithm the entire excess is pushed even though it 
may be blocked by one or more arcs that have insufficient residual capacity. 

The use of the lowest label selection rule in the pseudoflow-based simplex 
algorithm for the choice of an entering arc leads to precisely the same complexity 
as that of our pseudoflow algorithm, 0(mn log n). 

6.2 Parametric Analysis for Pseudoflow-Based Simplex 

Given a series of ^ parameter values for A, {Ai,...,A^}. Let the source adja- 
cent arcs capacities and the sink adjacent arc capacities be a linear function 
of A with the source adjacent capacities monotone nondecreasing with A and 
the sink adjacent capacities monotone nonincreasing with A. Recently Goldfarb 
and Ghen [GG96] presented a dual simplex method with running time 0{mn^). 
This method is adaptable to use for sensitivity analysis for such a sequence of 
£ parameter values, in the same amount of time as a single run, 0{mn^ + n£). 
The algorithm, however, does not generate all parameter breakpoints in a com- 
plete parametric analysis. Our algorithm is the first simplex algorithm that does 
generate all parameter breakpoints. 

The parametric analysis process is not described here. We only mention that 
in order to implement the complete parametric analysis we must recover from 
the simplex solution the minimal and maximal source sets minimum cuts. To do 
that, we scan the tree at the end of each computation for one parameter value, 
and separate 0-deflcit branches. This process is equivalent to the normalization 
of a tree. It adds only linear time to the running time and may be viewed as basis 
adjustment. The running time is linear in the number of nodes in the currently 
computed graph. 

The overall running time of the procedure is identical to that of the pseud- 
oflow algorithm with lowest label, 0(mn log n -I- n£). 

6.3 Comparing Simplex to Pseudoflow 

Although the simplex implementation in the extended network has the same 
complexity as that of the pseudoflow algorithm, the two algorithms are not the 
same. Starting from the same normalized tree a simplex iteration will produce 
different trees in the following iteration. The pseudoflow algorithm tends to pro- 
duce trees that are shallower than those produced by simplex thus reducing the 
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average work per iteration (which depends on the length of the path from the 
root to the merger node). Other details on the similarities and differences be- 
tween simplex and the pseudoflow algorithm are provided in the full version of 
the paper. 
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Abstract. The minimum- cost multicommodity flow problem involves si- 
multaneously shipping multiple commodities through a single network so 
that the total flow obeys arc capacity constraints and has minimum cost. 
Multicommodity flow problems can be expressed as linear programs, 
and most theoretical and practical algorithms use linear-programming 
algorithms specialized for the problems’ structures. Combinatorial ap- 
proximation algorithms in [GK96,KP95b,PST95] yield flows with costs 
slightly larger than the minimum cost and use capacities slightly larger 
than the given capacities. Theoretically, the running times of these algo- 
rithms are much less than that of linear-programming-based algorithms. 
We combine and modify the theoretical ideas in these approximation al- 
gorithms to yield a fast, practical implementation solving the minimum- 
cost multicommodity flow problem. Experimentally, the algorithm solved 
our problem instances (to 1% accuracy) two to three orders of magni- 
tude faster than the linear-programming package CPLEX [CPL95] and the 
linear-programming based multicommodity flow program PPRN [CN96]. 
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1 Introduction 

The minimum-cost multicommodity flow problem involves simultaneously ship- 
ping multiple commodities through a single network so the total flow obeys the 
arc capacity constraints and has minimum cost. The problem occurs in many 
contexts where different items share the same resource, e.g., communication net- 
works, transportation, and scheduling problems [AM093,HL96,H096]. 

Traditional methods for solving minimum-cost and no-cost multicommodity 
flow problems are linear-programming based [AMO93,Ass78,CN96,KH80]. Using 
the ellipsoid [Kha80] or the interior-point [Kar84] methods, linear-programming 
problems can be solved in polynomial time. Theoretically, the fastest algorithms 
for solving the minimum-cost multicommodity flow problem exactly use the 
problem structure to speed up the interior-point method [KP95a,KV86,Vai89]. 

In practice, solutions to within, say 1%, often suffice. More precisely, we say 
that a flow is e-optimal if it overflows the capacities by at most 1 -I- e factor and 
has cost that is within 1 -|- e of the optimum. Algorithms for computing approxi- 
mate solutions to the multicommodity flow problem were developed in [LMP+flS] 
(no-cost case) and [GK96,KP95b,PST95] (minimum-cost case). Theoretically, 
these algorithms are much faster than interior-point method based algorithms 
for constant e. The algorithm in [LMP+95] was implemented [LSS93] and was 
shown that indeed it often outperforms the more traditional approaches. Prior to 
our work, it was not known whether the combinatorial approximation algorithms 
for the minimum-cost case could be implemented to run quickly. 

In this paper we describe MCMCF, our implementation of a combinatorial ap- 
proximation algorithm for the minimum-cost multicommodity flow problem. A 
direct implementation of [KP95b] yielded a correct but practically slow imple- 
mentation. Much experimentation helped us select among the different theoret- 
ical insights of [KP95b,LMP+95,LSS93,PST95,Rad97] to achieve good practical 
performance. 

We compare our implementation with CPLEX [CPL95] and PPRN [CN96]. (Sev- 
eral other efficient minimum-cost multicommodity flow implementations, e.g., 
[ARVK89], are proprietary so we were unable to use these programs in our study.) 
Both are based on the simplex method [Dan63] and both And exact solutions. 
CPLEX is a state-of-the-art commercial linear programming package, and PPRN 
uses a primal partitioning technique to take advantage of the multicommodity 
flow problem structure. 

Our results indicate that the theoretical advantages of approximation algo- 
rithms over linear-programming-based algorithms can be translated into prac- 
tice. On the examples we studied, MCMCF was several orders of magnitude faster 
than CPLEX and PPRN. For example, for 1% accuracy, it was up to three orders 
of magnitude faster. Our implementation’s dependence on the number of com- 
modities and the network size is also smaller, and hence we are able to solve 
larger problems. 

We would like to compare MCMCF’s running times with modified CPLEX and 
PPRN programs that yield approximate solutions, but it is not clear how to make 
the modifications. Even if we could make the modifications, we would probably 
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need to use CPLEX’s primal simplex to obtain a feasible flow before an exact 
solution is found. Since its primal simplex is an order of magnitude slower than 
its dual simplex for the problem instances we tested, the approximate code would 
probably not be any faster than computing an exact solution using dual simplex. 

To And an e-optimal multicommodity flow, MCMCF repeatedly chooses a com- 
modity and then computes a single-commodity minimum-cost flow in an auxil- 
iary graph. This graph’s arc costs are exponential functions of the current flow. 
The base of the exponent depends on a parameter a, which our implementation 
chooses. A fraction a of the commodity’s flow is then rerouted to the correspond- 
ing minimum-cost flow. Each rerouting decreases a certain potential function. 
The algorithm iterates this process until it finds an e-optimal flow. 

As we have mentioned above, a direct implementation of [KP95b], while 
theoretically fast, is very slow in practice. Several issues are crucial for achieving 
an efficient implementation: 

Exponential Costs: The value of the parameter a, which defines the base of 
the exponent, must be chosen carefully: Using a value that is too small will 
not guarantee any progress, and using a value that is too large will lead 
to very slow progress. Our adaptive scheme for choosing a leads to signifi- 
cantly better performance than using the theoretical value. Importantly, this 
heuristic does not invalidate the worst-case performance guarantees proved 
for algorithms using fixed a. 

Stopping Condition: Theoretically, the algorithm yields an e-optimal flow 
when the potential function becomes sufficiently small [KP95b]. Alterna- 
tive algorithms, e.g., [PST95], explicitly compute lower bounds. Although 
these stopping conditions lead to the same asymptotic running time, the 
latter one leads to much better performance in our experiments. 

Step Size: Theory specifies the rerouting fraction ct as a fixed function of a. 
Computing cr that maximizes the exponential potential function reduction 
experimentally decreases the running time. We show that is it possible to use 
the Newton-Raphson method [Rap90] to quickly And a near-optimal value 
of (T for every rerouting. Additionally, a commodity’s flow usually differs 
from its minimum-cost flow on only a few arcs. We use this fact to speed up 
these computations. 

Minimum- Cost Flow Subroutine: Minimum-cost flow computations domi- 
nate the algorithm’s running time both in theory and in practice. The arc 
costs and capacities do not change much between consecutive minimum-cost 
flow computations for a particular commodity. Furthermore, the problem 
size is moderate by minimum-cost flow standards. This led us to decide to 
use the primal network simplex method. We use the current flow and a ba- 
sis from a previous minimum-cost flow to “warm-start” each minimum-cost 
flow computation. Excepting the warm-start idea, our primal simplex code 
is similar to that of Grigoriadis [Gri86]. 

In the rest of this paper, we first introduce the theoretical ideas behind the 
implementation. After discussing the various choices in translating the theoreti- 
cal ideas into practical performance, we present experimental data showing that 
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MCMCF’s running time’s dependence on the accuracy e is smaller than theoreti- 
cally predicted and its dependence on the number k of commodities is close to 
what is predicted. We conclude by showing that the combinatorial-based imple- 
mentation solves our instances two to three orders of magnitude faster than two 
simplex-based implementations. In the longer version of this paper, we will also 
show that a slightly modified MCMCF solves the concurrent flow problem, i.e., the 
optimization version of the no-cost multicommodity flow problem, two to twenty 
times faster than Leong et al.’s approximation implementation [LSS93]. 

2 Theoretical Background 

2.1 Definitions 

The minimum-cost multicommodity flow problem consists of a directed network 
G = (y, A), a positive arc capacity function u, a nonnegative arc cost function c, 
and a specification (sj, ti, dfl for each commodity i, i €. {1, 2, . . . , kg}. Nodes Si 
and ti are the source and the sink of commodity i, and a positive number di is 
its demand. 

A flow is a nonnegative arc function /. A flow fl of commodity t is a flow 
obeying conservation constraints and satisfying its demand di. We define the to- 
tal flow /(a) on arc a by /(a) = X]?=i Depending on context, the symbol / 

represents both the (multi)flow (/i, / 2 , • • ., fk) and the total flow /i-l-/ 2 H h/fc, 

summed arc-wise. The cost of a flow / is the dot product c • / = c(a)/(a). 

Given a problem and a flow /, the congestion of arc a is Aa = f{a)/u{a), 
and the congestion of the flow is Aa = max^ Aa. Given a budget B, the cost 
congestion is Ac = c • f/B, and the total congestion is A = max{AA, Ac}. A 
feasible problem instance has a flow / with Aa < 1. 

Our implementation approximately solves the minimum-cost multicommod- 
ity flow problem. Given an accuracy e > 0 and a feasible multicommodity flow 
problem instance, the algorithm finds an e-optimal flow f with e-optimal conges- 
tion, i.e., Aa < (1 + e), and e-optimal cost, i.e., if B* is the minimum cost of any 
feasible flow, f’s cost is at most (1 -F e)B* . Because we can choose e arbitrarily 
small, we can find a solution arbitrarily close to the optimal. 

We combine commodities with the same source nodes to form commodi- 
ties with one source and (possibly) many sinks (see [LSS93,Sch91]). Thus, the 
number k of commodity groups may be smaller than the number fco of simple 
commodities in the input. 



2.2 The Algorithmic Framework 

Our algorithm is mostly based on [KP95b]. Roughly speaking, the approach 
in that paper is as follows. The algorithm first finds an initial flow satisfying 
demands but which may violate capacities and may be too expensive. The al- 
gorithm repeatedly modifies the flow until it becomes 0(e)-optimal. Each it- 
eration, the algorithm first computes the theoretical values for the constant a. 
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and the step size a. It then computes the dual variables yr = where 

r ranges over the arcs A and the arc cost function c, and a potential function 
4>{f) = yr- The algorithm chooses a commodity i to reroute in a round robin 
order, as in [Rad97]. It computes, for that commodity, a minimum-cost flow f* 
in a graph with arc costs related to the gradient V(/)(/) of the potential func- 
tion and arc capacities Xau. The commodity’s flow fi is changed to the convex 
combination (1 — a)fi -I- af*. An appropriate choice of values for a and cr lead 
to 0{e~^nmk) running time (suppressing logarithmic terms). Grigoriadis and 
Khachiyan [GK96] decreased the dependence on e to e“^. 

Since these minimum-cost algorithms compute a multiflow having arc cost at 
most a budget bound B, we use binary search on B to determine an e-optimal 
cost. The arc cost of the initial flow gives the initial lower bound because the 
flow is the union of minimum-cost single-commodity flows with respect to the arc 
cost function c and arc capacities u. Lower bound computations (see Sect. 3.1) 
increase the lower bound and the algorithm decreases the congestion and cost 
until an e-optimal flow is found. 



3 Translating Theory into Practice 

The algorithmic framework described in the previous section is theoretically 
efficient, but a direct implementation requires orders of magnitude larger run- 
ning time than commercial linear-programming packages [GPL95]. Guided by 
the theoretical ideas of [KP95b,LMP+95,PST95], we converted the theoretically 
correct but practically slow implementation to a theoretically correct and practi- 
cally fast implementation. In some cases, we differentiated between theoretically 
equivalent implementation choices that differ in practicality, e.g, see Sect. 3.1. In 
other cases, we used the theory to create heuristics that, in practice, reduce the 
running time, but, in the worst case, do not have an effect on the theoretical 
running time, e.g., see Sect. 3.3. 

To test our implementation, we produced several families of random prob- 
lem instances using three generators, multigrid, rmfgen, and tripartite. 
Our implementation, like most combinatorial algorithms, is sensitive to graph 
structure. MCMCF solves the multigrid problem instances (based on [L091]) 
very quickly, while rmfgen instances (based on [Bad91]) are more difficult. We 
wrote the tripartite generator to produce instances that are especially difficult 
for our implementation to solve. More data will appear in the full paper. Brief 
descriptions of our problem generators and families will appear in [GOPS97]. 



3.1 The Termination Condition 

Theoretically, a small potential function value and a sufficiently large value of the 
constant a indicates the flow is e-optimal [KP95b], but this pessimistic indicator 
leads to poor performance. Instead, we periodically compute the lower bound 
on the optimal congestion A* as found in [LMP+95,PST95]. Since the problem 
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instance is assumed to be feasible, the computation indicates when the current 
guess for the minimum flow cost is too low. 

The weak duality inequalities yield a lower bound. Using the notation from 
[PST95], 



^ C.{Xa)> Y. . (1) 

r comm, i comm, i 

For commodity i, Ci(Ayi) represents the cost of the current flow fi with respect 
to arc capacities Xau and the cost function y^A, where A is the km x m arc 
adjacency matrix together with the arc cost function c. C*{Xa) is the minimum- 
cost flow. For all choices of dual variables and A ^4 > 1, A* > Vr ^ 

Si C'j*(Aa)/ yr- Thus, this ratio serves as a lower bound on the optimal 
congestion A*. 

3.2 Computing the Step Size cr 

While, as suggested by the theory, using a fixed step size a to form the con- 
vex combination (1 — a)fi + af* suffices to reduce the potential function, our 
algorithm computes a to maximize the potential function reduction. Brent’s 
method and similar strategies, e.g., see [LSS93], are natural strategies to max- 
imize the function’s reduction. We implemented Brent’s method [PFTV88], 
but the special structure of the potential function allows us to compute the 
function’s first and second derivatives. Thus, we can use the Newton-Raphson 
method [PFTV88,Rap90], which is faster. 

Given the current flow / and the minimum-cost flow f* for commodity i, the 
potential function (f>{a) is a convex function (with positive second derivative) of 
the step size a. Over the range of possible choices for cr, the potential function’s 
minimum occurs either at the endpoints or at one interior point. Since the func- 
tion is a sum of exponentials, the first and second derivatives 4>'{a) and 
are easy to compute. 

Using the Newton-Raphson method reduces the running time by two orders 
of magnitude compared with using a fixed step size. (See Table 1.) As the accu- 
racy increases, the reduction in running time for the Newton-Raphson method 
increases. As expected, the decrease in the number of minimum-cost flow com- 
putations was even greater. 

3.3 Choosing a 

The algorithm’s performance depends on the value of a. The larger its value, 
the more running time the algorithm requires. Unfortunately, a must be large 
enough to produce an e-optimal flow. Thus, we developed heuristics for slowly 
increasing its value. There are two different theoretical explanations for a than 
can be used to develop two different heuristics. 

Karger and Plotkin [KP95b] choose a so that, when the potential function is 
less than a constant factor of its minimum, the flow is e-optimal. The heuristic 
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Table 1. Computing an (almost) optimal step size reduces the running time and 
number of minimum-cost flow (MCF) computations by two orders of magnitude. 



problem e 


time (seconds) 
Newton fixed ratio 


number of MCFs 
Newton fixed ratio 


rmfgenl 0.12 
rmfgenl 0.06 
rmfgenl 0.03 
rmfgenl 0.01 


830 17220 20.7 

1810 83120 45.9 

5240 279530 53.3 
22930 2346800 102.3 


399 8764 22.0 

912 45023 49.4 

2907 156216 53.7 
13642 1361900 99.8 


rmfgen2 0.01 


3380 650480 192.4 


3577 686427 191.9 


rmfgen3 0.01 


86790 9290550 107.0 


17544 1665483 94.9 


multigridl 0.01 


980 57800 59.0 


1375 75516 54.9 



of starting with a small a and increasing it when the potential function’s value 
became too small experimentally failed to decrease significantly the running 
time. 

Plotkin, Shmoys, and Tardos [PST95] use the weak duality inequalities (1) 
upon which we base a different heuristic. The product of the gaps bounds the 
distance between the potential function and the optimal flow. The algorithm’s 
improvement is proportional to the size of the right gap, and increasing a de- 
creases the left gap’s size. Choosing a too large, however, can impede progress 
because progress is proportional to the step size cr which itself depends on how 
closely the potential function’s linearization approximates its value. Thus, larger 
a reduces the step size. 

Our heuristic attempts to balance the left and right gaps. More precisely, it 
chooses a dynamically to ensure the ratio of inequalities 

J2r y-r/ Z^comm. % ~ ^ 

(Scomm. i Cii^A)/ Z^comm. * {Xa)) ~ 1 

remains balanced. We increase a by factor [3 if the ratio is larger than 0.5 and 
otherwise decrease it by 7 . After limited experimentation, we decided to use the 
golden ratio for both [3 and 7 . The a values are frequently much lower than 
those from [PST95]. Using this heuristic rather than using the theoretical value 
of ln(3m)/e [KP95b] usually decreases the running time by a factor of between 
two and six. See Table 2. 



3.4 Choosing a Commodity to Reroute 

Several strategies for selecting the next commodity to reroute, proposed for con- 
current flows, also apply to the minimum-cost multicommodity flow problem. 
These strategies include weighted [KPST94] and uniform [Gol92] randomiza- 
tion, and round-robin [Rad97] selection. Our experiments, which also included 
adaptive strategies, suggest that round robin works best. 
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Table 2. Adaptively choosing a requires fewer minimum-cost flow (MCF) com- 
putations than using the theoretical, fixed value of a. 



problem e 


number of MCFs 
adaptive fixed ratio 


time (seconds) 
adaptive fixed ratio 


GTE 0.01 


2256 15723 6.97 


1.17 7.60 6.49 


rmfgenS 0.03 


3394 10271 3.03 


19.34 68.30 3.53 


rmfgend 0.10 
rmfgend 0.05 
rmfgend 0.03 


4575 6139 1.34 
6966 22579 3.24 
16287 53006 3.25 


24.65 33.00 1.34 
36.19 117.12 3.24 
80.10 265.21 3.31 


rmfgen5 0.03 


18221 64842 3.56 


140.85 530.03 3.76 


multigrid2 0.01 


1659 1277 0.77 


20.82 23.56 1.13 



3.5 “Restarting” the Minimum- Cost Flow Subroutine 

Theoretically, MCMCF can use any minimum-cost flow subroutine. In practice, the 
repeated evaluation of single-commodity problems with similar arc costs and 
capacities favor an implementation that can take advantage of restarting from a 
previous solution. We show that using a primal network simplex implementation 
allows restarting and thereby reduces the running time by one-third to one-half. 

To solve a single-commodity problem, the primal simplex algorithm repeat- 
edly pivots arcs into and out of a spanning tree until the tree has minimum cost. 
Each pivot maintains the flow’s feasibility and can decrease its cost. The sim- 
plex algorithm can start with any feasible flow and any spanning tree. Since the 
cost and capacity functions do not vary much between MCF calls for the same 
commodity, we can speed up the computation, using the previously-computed 
spanning tree. Using the previously-found minimum-cost flow requires 0{km) 
additional storage. Moreover, it is more frequently unusable because it is in- 
feasible with respect to the capacity constraints than using the current flow. In 
contrast, using the current flow requires no additional storage, this flow is known 
to be feasible, and starting from this flow experimentally requires a very small 
number of pivots. 

Fairly quickly, the number of pivots per MCF iteration becomes very small. 
See Fig. 1. For the 2121-arc rmfgen-d-7-10-040 instance, the average number of 
pivots are 27, 13, and 7 for the three commodities shown. Less than two percent 
of arcs served as pivots. For the 260-arc GTE problem, the average numbers are 
8, 3, and 1, i.e., at most three percent of the arcs. 

Instead of using a commodity’s current flow and its previous spanning tree, 
a minimum-cost flow computation could start from an arbitrary spanning tree 
and flow. On the problem instances we tried, restarting reduces the running time 
by a factor of about 1.3 to 2. See Table 3. Because optimal flows are not unique, 
the number of MCF computations differ, but the difference of usually less than 
five percent. 
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Fig. 1. The cumulative number of pivots as a function of the number of 
minimum-cost flow (MCF) calls for three different commodities in two prob- 
lem instances 

4 Experimental Results 

4.1 Dependence on the Approximation Factor e 

The approximation algorithm MCMCF yields an e-optimal flow. Plotkin, Shmoys, 
and Tardos [PST95] solve the minimum-cost multicommodity flow problem using 
shortest-paths as a basic subroutine. Karger and Plotkin [KP95b] decreased 
the running time by nijn using minimum-cost flow subroutines and adding a 
linear-cost term to the gradient to ensure each flow’s arc cost is bounded. This 
change increases the e-dependence of [PST95] by 1/e to e“^. Grigoriadis and 
Khachiyan [GK96] improved the [KP95b] technique, reducing the e-dependence 
back to e“^. MCMCF implements the linear-cost term, but experimentation showed 
the minimum-cost flows’ arc costs were bounded even without using the linear- 
cost term. Furthermore, running times usually decrease when omitting the term. 

Table 3. Restarting the minimum-cost flow computations using the current flow 
and the previous spanning tree reduces the running time by at least 35%. 



problem instance e 


restarting no restarting 

time (seconds) time (seconds) ratio 


rmfgen-d- 7- 10-020 0.01 


88 


180 


2.06 


rmfgen-d- 7- 10-240 0.01 


437 


825 


1.89 


rmfgen-d-7-12-240 0.01 


613 


993 


1.62 


rmfgen-d- 7- 14-240 0.01 


837 


1396 


1.67 


rmfgen-d-7-16-240 0.01 


1207 


2014 


1.67 


mnltigrid-032-032-128-0080 0.01 


21 


37 


1.77 


mnltigrid-064-064-128-0160 0.01 


275 


801 


2.92 
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The implementation exhibits smaller dependence than the worst-case no- 
cost multicommodity flow dependence of We believe the implementa- 

tion’s searching for an almost-optimal step size and its regularly computing lower 
bounds decreases the dependence. Figure 2 shows the number of minimum-cost 
flow computations as a function of the desired accuracy e. Each line represents 
a problem instance solved with various accuracies. On the log-log scale, a line’s 
slope represents the power of 1/e. For the rmfgen problem instances, the depen- 
dence is about For most multigrid instances, we solved to a maximum 

accuracy of 1% but for five instances, we solved to an accuracy of 0.2%. These 
instances depend very little on the accuracy; MCMCF yields the same flows for 
several different accuracies. Intuitively, the grid networks permit so many differ- 
ent routes to satisfy a commodity that very few commodities need to share the 
same arcs. MCMCF is able to take advantage of these many different routes, while, 
as we will see in Sect. 5, some linear-programming based implementations have 
more difficulty. 



RMFGEN Instances 





Inverse 1/e of Desired Accuracy (log scale) 



Fig. 2. The number of minimum-cost flow (MCF) computations as a function 
of 1/e for RMFGEN instances is and for multigrid instances 



4.2 Dependence on the Number k of Commodity Groups 

The experimental number of minimum-cost flow computations and the running 
time of the implementation match the theoretical upper bounds. Theoretically, 
the algorithm performs 0{e~^k) minimum-cost flow computations, as described 
in Sect. 2.2. These upper bounds (ignoring the e dependence and logarithmic 
dependences) match the natural lower bound where the joint capacity constraints 
are ignored and the problem can be solved using k single-commodity minimum- 
cost flow problems. In practice, the implementation requires at most a linear 
(in fc) number of minimum-cost flows. 






348 Andrew V. Goldberg et al. 



Figure 3 shows the number of minimum-cost flow computations as a function 
of the number k of commodity groups. Each line represents a fixed network with 
various numbers of commodity groups. The multigrid figure shows a depen- 
dence of approximately 25k for two networks. For the rmfgen instances, the 
dependence is initially linear but flattens and even decreases. As the number 
of commodity groups increases, the average demand per commodity decreases 
because the demands are scaled so the instances are feasible in a graph with 
60% of the arc capacities. Furthermore, the randomly distributed sources and 
sinks are more distributed throughout the graph reducing contention for the 
most congested arcs. The number of minimum-cost flows depends more on the 
network’s congestion than on the instance’s size so the lines flatten. 



RMFGEN Instances MULTIGRID Instances 





Fig. 3. The number of minimum-cost flow (MCF) computations as a function 
of the number k of commodity groups for rmfgen and multigrid instances 



5 Comparisons with Other Implementations 

5.1 The Other Implementations: CPLEX and PPRN 

We compared MCMCF (solving to 1% accuracy) with a commercial linear-program- 
ming package CPLEX [CPL95] and the primal partitioning multicommodity flow 
implementation PPRN [CN96]. 

CPLEX (version 4.0.9) yields exact solutions to multicommodity flow linear 
programs. When forming the linear programs, we group the commodities since 
MCMCF computes these groups at run-time. CPLEX’s dual simplex method yields 
a feasible solution only upon completion, while the primal method, in principle, 
could be stopped to yield an approximation. Despite this fact, we compared 
MCMCF with CPLEX’s dual simplex method because it is an order of magnitude 
faster than its primal simplex for the problems we tested. 
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PPRN [CN96] specializes the primal partitioning linear programming tech- 
nique to solve multicommodity problems. The primal partitioning method splits 
the instance’s basis into bases for the commodities and another basis for the joint 
capacity constraints. Network simplex methods then solve each commodity’s sub- 
problem. More general linear-programming matrix computations applied to the 
joint capacity basis combine these subproblems’ solutions to solve the problem. 



5.2 Dependence on the Number k of Commodity Groups 

The combinatorial algorithm MCMCF solves our problem instances two to three 
orders of magnitude faster than the linear-programming-based implementations 
CPLEX and PPRN. Furthermore, its running time depends mostly on the network 
structure and much less on the arc costs’ magnitude. 

We solved several different rmfgen networks (see Fig. 4) with various num- 
bers of commodities and two different arc cost schemes. Even for instances having 
as few as fifty commodities, MCMCF required less running time. Furthermore, its 
dependence on the number k of commodities was much smaller. For the left 
side of Fig. 4, the arc costs were randomly chosen from the range [1,100]. For 
these problems, CPLEX’s running time is roughly quadratic in k, while MCMCF ’s 
is roughly linear. Although for problems with few commodities, CPLEX is some- 
what faster, for larger problems MCMCF is faster by an order of magnitude. PPRN 
is about five times slower than CPLEX for these problems. Changing the cost of 
interframe arcs significantly changes CPLEX’s running time. (See the right side 
of Fig. 4.) Both MCMCF’s and PPRN’s running times decrease slightly. The running 
times’ dependences on k do not change appreciably. 



RMFGEN with Nonzero-Cost Interframe Arcs RMFGEN with Zero-Cost Interframe Arcs 




Fig. 4. The running time in minutes as a function of the number k of commodity 
groups for two different rmfgen networks with twelve and fourteen frames. 
CPLEX’s and PPRN’s dependences are larger than MCMCF’s 
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MCMCF solves MULTIGRID networks two to three orders of magnitude faster 
than CPLEX and PPRN. The left side of Fig. 5 shows MCMCF’s running time using 
a log-log scale for two different networks: the smaller one having 1025 nodes 
and 3072 arcs and the larger one having 4097 nodes and 9152 arcs. CPLEX and 
PPRN required several days to solve the smaller network instances so we omitted 
solving the larger instances. Even for the smallest problem instance, MCMCF is 
eighty times faster than CPLEX, and its dependence on the number of commodities 
is much smaller. PPRN is two to three times slower than CPLEX so we solved only 
very small problem instances using PPRN. 



MULTIGRID Instances Tripartite Instances 





Number k of Commodity Groups (log scale) N um ber of frames 

Fig. 5. The running time in minutes as a function of the number k of commodity 
groups (left figure) and the number of frames (right figure) 



5.3 Dependence on the Network Size 

To test the implementations’ dependences on the problem size, we used tripar- 
tite problem instances with increasing numbers of frames. Each frame has fixed 
size so the number of nodes and arcs is linearly related to the number of frames. 
For these instances, MCMCF’s almost linear dependence on problem size is much 
less than CPLEX’s and PPRN’s dependences. See the right side of Fig. 5. (MCMCF 
solved the problem instances to two-percent accuracy.) As described in Sect. 3.5, 
the minimum-cost flow routine needs only a few pivots before a solution is found. 
(For the sixty-four frame problem, PPRN required 2890 minutes so it was omitted 
from the figure.) 

6 Concluding Remarks 

For the problem classes we studied, MCMCF solved minimum-cost multicommodity 
flow problems significantly faster than state-of-the-art linear-programming-based 
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programs. This is strong evidence the approximate problem is simpler, and that 
combinatorial-based methods, appropriately implemented, should be considered 
for this problem. We believe many of these techniques can be extended to other 
problems solved using the fractional packing and covering framework of [PST95] . 

We conclude with two unanswered questions. Since our implementation never 
needs to use the linear-cost term [KP95b], it is interesting to prove whether the 
term is indeed unnecessary. Also, it is interesting to try to prove the experimental 
(^(g-1.5) dependence of Sect. 4.1. 
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Abstract. We provide several non-approximability results for determin- 
istic scheduling problems whose objective is to minimize the total job 
completion time. Unless V = NV, none of the problems under consid- 
eration can be approximated in polynomial time within arbitrarily good 
precision. Most of our results are derived by Max SNP hardness proofs. 
Among the investigated problems are; scheduling unrelated machines 
with some additional features like job release dates, deadlines and weights, 
scheduling flow shops, and scheduling open shops. 



1 Introduction 

Since the early 1970s, the algorithms and optimization community has put 
lots of efforts into identifying the computational complexity of various combi- 
natorial optimization problems. Nowadays it is common knowledge that, when 
dealing with an AfP-hard optimization problem, one should not expect to find 
a polynomial-time solution algorithm. This insight motivates the search for ap- 
proximation algorithms that output provably good solutions in polynomial time. 
It also immediately raises the question of how well we can approximate a specific 
optimization problem in polynomial time. 

We say that a polynomial-time approximation algorithm for some optimiza- 
tion problem has a performance guarantee or worst-case ratio p, if it outputs a 
feasible solution with cost at most p times the optimum value for all instances of 
the problem; such an algorithm is also called a polynomial-time p-approximation 
algorithm. Now, given an AfP-hard optimization problem, for which values of 
p does there exist and for which values of p does there not exist a polynomial- 
time p-approximation algorithm? In this paper we focus on ‘negative’ results 
for scheduling problems in this area, i.e., we demonstrate for several scheduling 
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R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
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problems that they do not have polynomial-time /o-approximation algorithms 
with p arbitrarily close to 1, unless V = NV. 

Until now, the literature only contains a small number of non-approximability 
results for scheduling problems. Most of the known non-approximability results 
have been derived for the objective of minimizing the makespan. As far as we 
know, the only non-approximability results for scheduling problems with minsum 
objective are presented in the papers of Kellerer, Tautenhahn & Woeginger [7] 
and Leonardi & Raz [12]. In the first paper, the authors prove that the problem 
of minimizing total flow time on a single machine subject to release dates cannot 
be approximated in polynomial time within any constant factor. In the second 
one, the authors derive a similar result for the total flow time problem subject 
to release dates on parallel machines. Below, we give a short list with the most 
important results for makespan minimization, all under the assumption that 
V = MV; for more detailed information on the scheduling problems mentioned, 
see the first paragraphs of Sections 3 and 4, or see the survey article by Lawler, 
Lenstra, Rinnooy Kan & Shmoys [8]. We refer the reader interested in more 
results on approximability and non-approximability in scheduling to Lenstra & 
Shmoys [10]. Throughout this paper, we use the notation introduced by Graham, 
Lawler, Lenstra & Rinnooy Kan [4] to denote the scheduling problems. 

— Lenstra & Rinnooy Kan [9] prove that P \ prec, pj = 1\ Cmax (makespan min- 
imization on parallel machines with precedence constraints and unit process- 
ing times) does not have a polynomial-time approximation algorithm with 
performance guarantee strictly better than 4/3. 

— Lenstra, Shmoys & Tardos [11] show that R\ [Cmax (makespan minimiza- 
tion on unrelated machines) does not have a polynomial-time approximation 
algorithm with performance guarantee strictly better than 3/2. 

— Hoogeveen, Lenstra & Veltman [5] prove that P \ prec, c = l,pj = 1 1 Cmax 
and Poo \ prec, c = l,Pj = 1 ] Cmax (two variants of makespan minimization 
on parallel machines with unit processing times and unit communication 
delays) cannot be approximated with worst-case ratios better than 5/4 and 
7/6, respectively. 

— Williamson et al. [16] prove that O \ \ Cmax and F \ \ Cmax (makespan mini- 
mization in open shops and flow shops, respectively) cannot be approximated 
in polynomial time with performance guarantees better than 5/4. 

All the listed non-approximability results have been derived via the so-called gap 
technique, i.e., via A/”7^-hardness reductions that create gaps in the cost function 
of the constructed instances. More precisely, such a reduction transforms the 
YES-instances of some AfP-hard problem into scheduling instances with ob- 
jective value at most c* , and it transforms the NO-instances into scheduling 
instances with objective value at least g ■ c* , where g > I is some fixed real 
number. Then a polynomial-time approximation algorithm for the scheduling 
problem with performance guarantee strictly better than g (i.e., with guaran- 
tee g — s where e > 0) would be able to separate the YES-instances from the 
NO-instances, thus yielding a polynomial-time solution algorithm for an J\fP- 
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complete problem. Consequently, unless V = AfV, the scheduling problem can- 
not have a polynomial-time /o-approximation algorithm with p < g. 

In this paper, we present non-approximability results for the corresponding 
minsum versions of the makespan minimization problems listed above. We prove 
that none of the scheduling problems 

- R\rj\Y. Cj, R\dj\Y. Cj, and i? I I X; WjCj] 

- F\ I EQ andOl I EQ; 

- P \prec,pj = 1 I EQ; 

- P I prec, c = l,pj = 1 1 E Cj and Poo | prec, c = l,pj = 1 | E Cj 

can be approximated in polynomial time within arbitrarily good precision, un- 
less P = MP. Our main contribution is the non-approximability result for 
the problem P | r j | E Cj , which answers an open problem posed by Skutella 
[15]. Interestingly, we do not prove this non-approximability result by apply- 
ing the standard gap technique that we sketched above, but by establishing 
Max SNP-hardness of this problem (see Section 2 for some information on 
Max SNP-hardness). In the Max SNP-hardness proof for P | r j | E Cj we are 
recycling some of the ideas that Lenstra, Shmoys & Tardos [11] used in their 
gap reduction for P | | Cmax- Also the non-approximability results for P | | E Cj 
and O \ I E Cj are established via Max SNP-hardness proofs; part of these ar- 
guments are based on combinatorial considerations that first have been used 
by Williamson et al. [16]. The non-approximability results for P\prec,pj = 

1 I E Cj , for P I prec, c= l,pj = 1 1 for Poo | prec, c= l,pj = 1 1 E Cj 

follow from the well-known gap reductions for the corresponding makespan min- 
imization problems P\prec,pj = 1 1 Cmax, P\prec,c = l,pj = 1 1 Cmax, and 
Poo I prec, c = l,pj = 1 1 Cmax in a straightforward way, and therefore they are 
not described in this paper. 

The paper is organized as follows. In Section 2, we summarize some use- 
ful information on approximation schemes, non-approximability. Max SNP- 
hardness, and P-reductions, as we need it in the remainder of the paper. The 
non-approximability results are presented in Sections 3 and 4: Section 3 deals 
with scheduling unrelated machines and Section 4 considers flow shops and open 
shops. The paper is concluded with a short discussion in Section 5. 

2 Preliminaries on Non-approximability 

This section gives some information on approximation algorithms, and it summa- 
rizes some basic facts on Max SNP-hardness. For a more extensive explanation 
we refer the reader to Papadimitriou & Yannakakis [14]. For a compendium of 
publications on non-approximability results we refer to Crescenzi & Kann [2] . 

A polynomial-time approximation scheme for an optimization problem, PTAS 
for short, is a family of polynomial-time (1 -I- £)-approximation algorithms for 
all e > 0. Essentially, a polynomial-time approximation scheme is the strongest 
possible approximability result for an AfP-haxd problem. For AfP-haxd prob- 
lems an important question is whether such a scheme exists. The main tool for 
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dealing with this question is the L -reduction as introduced by Papadimitriou & 
Yannakakis [14]: 

Definition 1 (Papadimitrion and Yannakakis [14]). Let A and B be two 

optimization problems. An L-reduction from A to B is a pair of functions R 
and S, both computable in polynomial time, with the following two additional 
properties: 

— For any instance I of A with optimum cost Opt(/), R{I) is an instance of 
B with optimum cost OPT(i?(/)), such that 

Opt{R{I)) < a ■ Opt(/), (1) 

for some positive constant a. 

— For any feasible solution s of R{I), S{s) is a feasible solution of I such that 

|Opt(/) - c(5(s))| < ft • |OPT(i?(/)) - c(s)|, (2) 

for some positive constant (3, where c{S{s)) and c(s) represent the costs of 
S{s) and s, respectively. □ 

Papadimitriou & Yannakakis [14] prove that L-reductions in fact are approx- 
imation preserving reductions. If there is an L-reduction from the optimiza- 
tion problem A to problem B with parameters a and j3, and if there exists 
a polynomial-time approximation algorithm for B with performance guarantee 
1 -|- e, then there exists a polynomial-time approximation algorithm for A with 
performance guarantee l-\-a(3e. Consequently, if there exists a PTAS for B, then 
there also exists a PTAS for A. 

Papadimitriou & Yannakakis [14] also define a class of optimization problems 
called Max SNP, and they prove that every problem in this class is approx- 
imable in polynomial time within some constant factor. The class Max SNP is 
closed under L-reductions, and the hardest problems in this class (with respect 
to L-reductions) are the Max SNP-complete ones. A problem that is at least as 
hard (with respect to L-reductions) as a Max SNP-complete problem is called 
Max SNP -hard. For none of these Max SNP-hard problems, a PTAS has been 
constructed. Moreover, if there does exist a PTAS for one Max SNP-hard prob- 
lem, then all Max SNP-hard problems have a PTAS, and 

Proposition 2 (Arora, Lund, Motwani, Sudan, and Szegedy [1]). If 

there exists a PTAS for some Max SNP-hard problem, then V = AfV. □ 

Proposition 2 provides a strong tool for proving the non-approximability of an 
optimization problem X. Just provide an L-reduction from a Max SNP-hard 
problem to X. Then unless V = AfV, problem X cannot have a PTAS. 

3 Unrelated Parallel Machine Schednling 

The unrelated parallel machine scheduling problem considered in this section 
is defined as follows. There are n independent jobs 1, 2, . . . , n that have to be 
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scheduled on m machines Mi, M 2 , . . . , Mm- Preemptions are not allowed. Every 
machine can only process one job at a time and every job has to be processed on 
exactly one machine. If job j is scheduled on machine Mi, then the processing 
time required is pij. Every job j has a release date Vj on which it becomes avail- 
able for processing. The objective is to minimize the total job completion time. 
In the standard scheduling notation this problem is denoted by i? | r j | ^ Cj. 

We prove the non-approximability of i? | | ^ Cj by presenting an L-reduc- 

tion from a Max SNP-hard 3-dimensional matching problem to R\vj\ ^Cj. 
This L-reduction draws some ideas from the gap reduction of Lenstra, Shmoys & 
Tardos [11] for R\ | Cmax- Consider the following Maximum Bounded 3-Dimen- 
sional Matching problem (MAX-3DM-B), which has been proven to be Max 
SNP- hard by Kann [6]. 

MAXIMUM BOUNDED 3-DIMENSIONAL MATCHING (Max-3DM-B) 

Input: Three sets A = {ai, . . . , Oq}, B = {bi , . . . , bq} and C = {ci, . . . , Cq}. A 
subset T of A X B X C of cardinality s, such that any element of A, B and 
C occurs in exactly one, two, or three triples in T. Note that this implies that 
q < s <3q. 

Goal: Find a subset T' of T of maximum cardinality such that no two triples of 
T' agree in any coordinate. 

Measure: The cardinality of T' . 

The following simple observation will be useful. 

Lemma 3. For any instance I o/Max-3DM-B, we have Opt(J) > ^s. □ 

Now let I = (q,T) be an instance of MAX-3DM-B. We construct an instance 
R{I) of i? I Tj I ^ Cj with 3q+ s jobs and s + q machines. The first s machines 
correspond to the triples in T, and hence are called the triple machines. The 
remaining q machines are the dummy machines. The 3g-|-s jobs are divided into 
3g element jobs and into s dummy jobs. The 3q element jobs correspond to the 
elements of A, B, and C, and are called A-jobs, B-jobs, and C-jobs, respectively. 

Let Ti = (ai,bj,Ck) be the ?th triple in T. The processing times on the 
Ith triple machine are now defined as follows. The processing time of the three 
element jobs that correspond to the elements Oj, bj, and Ck is 1, whereas all 
the other element jobs have processing time infinity. The dummy jobs have 
processing time 3 on any triple machine and also on any dummy machine. All 
A-jobs and B-jobs have processing time 1 on the dummy machines, and all C- 
jobs have infinite processing time on the dummy machines. The release dates 
of all A-jobs and all dummy jobs are 0, all B-jobs have release date 1, and all 
C-jobs have release date 2. 

In the following we are mainly interested in schedules of the following struc- 
ture. Among the triple machines, there are k machines that process the three 
element jobs that belong to the triple corresponding to the machine (we call such 
machines good machines), there are q — k machines that process a dummy job 
together with a C-job, and there are s — q machines that process a single dummy 
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job. The q dummy machines are split into two groups: q — k of them process an 
yl-job and a i?-job, and k of them process a single dummy job. An illustration for 
schedules of this structure is given in Figure 1. All jobs are scheduled as early as 
possible. The cost Zk of a schedule cr with the special structure described above 



q-k 



s-q 



s 

>. 

a 

a 
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Fig. 1. A schedule for R{I) with a special structure. 



only depends on the number k of good machines, and it is equal to 

Zk = Cb + Cd 

A— jobs a B—jobs b C—jobs c dummy jobs d 

= g + 2<7 + 3fc + 4(<7 — fc) + 3s 
= 7q — k is. 



Lemma 4. Let a be any feasible schedule for R{I) with k good machines. Then 
the objective value of a is at least equal to Zk, and there exists a feasible sched- 
ule a' satisfying the structure described above with the same number k of good 
machines. 

Proof. We argue that the objective value of schedule a is at least Zk; then cr 
can be replaced by a schedule a' with the desired structure, which is readily 
determined given the k good machines. Note that a schedule in which every job 
starts at its release date has objective value equal to 6q + 3s. We prove that the 
extra cost caused by jobs that do not start at their release date is at least q — k. 
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Let , and be the number of ^-jobs, B-jobs, and C-jobs, respectively, 

that do not start at their release dates. Let be the number of dummy jobs 
that start at time 1, and let be the number of dummy jobs that start at 
or after time 2. Finally, let k\ and ^2 be the number of machines that are idle 
during the time intervals [0, 1] and [1, 2], respectively. There are q — of the C- 
jobs whose processing starts at their release date. All these C-jobs are processed 
on triple machines, and only k triple machines are good machines. Hence, the 
remaining number of at least q — i^ — k machines that process such a (7-job must 
be idle during the time interval [0, 1], or during [1,2], or during both intervals. 
This yields that 



ki + k 2 > q — — k. (3) 

The number of jobs that have release date zero equals the number of machines. 
Hence, the processing of at least ki of the A-jobs and dummy jobs does not start 
at time zero, and we conclude that + £2 ^ ^1 • Analogously at least ^2 

of the H-jobs and dummy jobs are not processed during the time interval [1,2], 
and therefore + £2 ^ ^2 holds. Summarizing, one gets that the additional 
cost caused by jobs in a that do not start at their release date is at least 

£^ + £^ + £^ + £^ + 2 £^ = {£^ + £^ + £^) + {£^ + £^) + £^ 

> ki + k 2 + £^ > q — £^ — k + £^ = q — k. 

Hereby the proof of the lemma is complete. □ 

Lemma 5. For any instance I of Max-SDM-B, we have that OPT(i?(/)) < 
69 0pt(/). Hence, the polynomial-time transformation R fulfills condition (1) 
of Definition 1 . 

Proof. Let k = Opt(/). The statement of Lemma 4 yields that OPT(i?(/)) = 
7q — k 3s, and Lemma 3 yields that k > js > ^q. Hence, OPT(i?(/)) < 
49/c-/c-h21A: = 69 0PT(/). □ 

Next, we define a polynomial-time transformation S that maps feasible solu- 
tions of R{I) to feasible solutions of I. Let ct be a feasible schedule for R{I). As 
described in the proof of Lemma 4, we find a corresponding schedule a' for a that 
has the special structure. Then the feasible solution S{a) for the instance / of 
MAX-3DM-B consists of the triples in T that correspond to the good machines 
in (j7 

Lemma 6. For any feasible schedule a of R{I), the feasible solution S{a) of 
instance I fulfills the inequality |Opt(/) — c(S'((t))| < |OPT(i?(/)) — c(cr)|. Hence, 
the polynomial-time transformation S fulfills condition (2) of Definition 1. 

Proof. The statement of Lemma 4 implies that if there exists a schedule of cost 
7q-\-3s—k' for R{I), then there exists a solution T' for / with cardinality |T'j = k' . 
Let k = Opt(/). Then ]Opt(/) — c(S'((j))| = k — k' and |OPT(i?(/)) — c(cr)| = 
|7(7 -I- 3s — A: — (7(7 -I- 3s — fc')|. □ 
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Summarizing, Lemma 5 and Lemma 6 state that the transformations R and S 
satisfy both conditions in Definition 1, and hence constitute a valid L-reduction. 
We formulate the following theorem. 

Theorem 7. The scheduling problem R\rj\ ^Cj is Max SNP-hard and thus 
does not has a PTAS, unless V = NV . □ 

The same idea works to prove Max SNP-hardness of R\dj\ ^ Cj (scheduling 
unrelated machines with job deadlines) and R\ \ ^ ruj Cj (minimizing the total 
weighted completion time). 

4 Shop Scheduling Problems 

The shop scheduling problems considered in this section are defined as follows. 
There are n jobs 1, 2, . . . , n and m machines Mi, M 2 , . . . , Mm- Each job j consists 
of m operations Oij , 02j , • ■ • , Omj ■ Operation Oij has to be processed on Mi for 
a period of pij units. No machine may process more than one job at a time, 
and no two operations of the same job may be processed at the same time. 
Preemption is not allowed. 

There are three types of shop models. First, there is the open shop in which 
it is immaterial in what order the operations are executed. Secondly, there is the 
job shop in which the processing order of the operations is prespecified for each 
job; different jobs may have different processing orders. Finally, there is the flow 
shop, which is a special case of the job shop: here, the prespecified processing 
order is the same for all jobs. In the standard scheduling notation, the open shop, 
job shop, and flow shop with total job completion time objective are denoted by 
0\ I X) Q > I I X Q > F I I '^Cj, respectively. 

In this section, we provide a proof for the following theorem. 

Theorem 8. The scheduling problems 0\ \ ^Cj, J\ \ '^Cj, and F| | XQ 
are Max SNP -hard. Unless V = NV , they do not have a PTAS. 

Williamson et al. [16] show by a gap reduction from a variant of 3-satisfiability 
that the makespan minimization shop problems J | | Cmax and F \ \ Cmax cannot 
be approximated in polynomial time within a factor better than 5/4 (unless 
V = MV). In proving Theorem 8, we use an L-reduction that is based on this 
gap reduction. Our L-reduction is from the following version Max-2Sat-B of 
maximum 2-satisfiability. 

MAXIMUM BOUNDED 2-SATISFIABILITY (Max-2Sat-B) 

Input: A set [/ = {x \, . . . , Xq\ of variables and a collection C = {ci, . . . , Cg} of 
clauses over U. Each clause consists of exactly two (possibly identical) literals. 
Each variable occurs either once or twice in negated form in C , and either once 
or twice in unnegated form in C. 

Goal: Find a truth assignment for U such that a maximum number of clauses 
is satisfied. 

Measure: The number of satisfied clauses. 
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Proposition 9. The above defined problem Max-2Sat-B is Max SNP -hard. 

□ 

In analogy to Lemma 3, we argue that for any instance I of Max-2Sat-B, the 
optimum value is bounded away from zero. 

Lemma 10. For any instance I o/ M ax-2Sat-B, we have Opt(/) > ^s. □ 



4.1 Flow Shop Scheduling 

Let / = ([/, C) with U = {x \, . . . , Xq} and C = {c\, . . . , c^} be an instance of 
Max-2Sat-B. We define a flow shop instance R{I) as follows. 

In constructing this instance R{I), we distinguish between the first and the 
second unnegated (respectively, negated) occurrence of each literal. For j = 1,2, 
we refer to the jth occurrence of the literal Xi as Xij, and to the jth occurrence of 
Xi as Xij. With each variable Xi we associate four corresponding so-called variable 
jobs: the two jobs xn and Xi2 correspond to the first and second occurrence 
of literal Xi, and the two jobs xn and Xi2 correspond to the first and second 
occurrence of the literal Xi. With each literal x G {xi 2 ,Xi 2 } that does not occur 
in the formula, we further associate three so-called dummy jobs di{x), d 2 {x), and 
ds^x). The processing time of every operation is either 0 or 1. Every variable job 
X has exactly three operations with processing time 1 and m — 3 operations of 
processing time 0. The three length 1 operations are a beginning operation B(x), 
a middle operation M{x) and an ending operation E{x). A dummy job di{x) 
only has a single operation Di{x) with processing time 1, and to — 1 operations 
of processing time 0. There are four classes of machines. 

— For each variable Xj, there are two assignment machines: the first one pro- 
cesses the operations B{xj\) and Bi^ji)^ whereas the second one processes 
the operations B{xj 2 ) and B(xj 2 ). 

— For each variable xj, there are two consistency machines: the first one pro- 
cesses the operations M{xji) and M{xj 2 ), whereas the second one processes 
the operations M{xj 2 ) and M{xji). 

— For each clause aV6, there is a clause machine that processes E{a) and E{b). 

— For each literal x G {xi 2 ,Xi 2 } that does not occur in the formula, there 
is a garbage machine that processes the operation E{x) and the dummy 
operations Di{x), £> 2 ( 2 ;), and D^{x). 

The processing of every job first goes through the assignment machines, then 
through the consistency machines, then through the clause machines, and finally 
through the garbage machines. Since most operations have length 0, the precise 
processing order within every machine class is not essential; we only note that 
for every variable job, processing on the first assignment (first consistency) ma- 
chine always precedes processing on the second assignment (second consistency) 
machine. Similarly as in Section 3, we are mainly interested in schedules for 
R{I) with a special combinatorial structure. In a so-called consistent schedule, 
for every variable x either both operations B(xn) and B(xi 2 ) are processed in 
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Fig. 2. A consistent schedule for the flow shop instance. 



the interval [0,1], or both operations B(xn) and B(xi 2 ) are processed during 
[0, 1]. Moreover, in a consistent schedule the machines process the operations of 
length 1 during the following intervals. The assignment machines are only pro- 
cessing length 1 operations during [0,2], and the consistency machines are only 
processing such operations during [1, 3]. On every clause machine, the operations 
of length 1 are either processed during [2,3] and [3,4] or during [3,4] and [4,5]. 
On the garbage machines, all four operations of length 1 are processed during 
[0,4]; there are no restrictions on the processing order on the garbage machines. 
Figure 2 gives an illustration of a consistent schedule. 

Lemma 11. Let a be any feasible schedule for R{I). Then there exists a feasible 
consistent schedule a' whose objective value is at most the objective value of a. 

Proof. Let cr be an arbitrary feasible schedule for R{I). Throughout the proof, 
we only deal with the placement of length 1 operations (as placing the length 0 
operations is straightforward); a machine is busy if it is processing a length 1 
operation. Our first goal is to transform cr into a schedule in which the assign- 
ment machines are only busy during [0, 2] and the consistency machines are only 
busy during [1,3]. We start by shifting all operations on the assignment and 
consistency machines as far to the left as possible without violating feasibility. 
Clearly, in the resulting schedule all operations on the assignment machines are 
processed during [0,2]. 

Now suppose that on some consistency machine some operation, say opera- 
tion M{xii), only completes at time four. Since all operations were shifted to the 
left, this yields that both operations B{xn) and B{xi 2 ) are scheduled during the 
time interval [1,2]. Moreover, operations B{xn) and B{xi 2 ) are both processed 
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during [0,1]. We proceed as follows: If operation M{xn) is processed before 
M{xi 2 ), then we switch the order of the operations B{xi 2 ) and B{xi 2 ) on their 
assignment machine. Moreover, we reschedule operation M^xa) during [1, 2] and 
operation M{xn) during [2,3]. If operation M{xn) is processed after M{xi 2 ), 
then we perform a symmetric switching and rescheduling step. Note that after 
these switches, the schedule is still feasible and that on the consistency machines 
no operation has been shifted to the right. 

By performing such switches, we eventually get a schedule in which all as- 
signment machines are busy during [0 , 2] and all consistency machines are busy 
during [1,3]. Finally, by shifting all operations on the clause and garbage ma- 
chines as far to the left as possible, we obtain a schedule a' . It is routine to check 
that in schedule a' every clause machine is either busy during [2, 4] or during 
[3,5], and every garbage machine is busy during [0,4]. Since in schedule a' no 
job finishes later than in schedule cr, the objective value has not been increased. 

It remains to be proven that the constructed schedule o' indeed is a consistent 
schedule, i.e., to prove that for every variable Xi^ the operations B{xn) and 
B{xi 2 ) are processed simultaneously. First, assume that operation B{xn) starts 
at time 0. This implies that B{^n) is processed during [1,2], and that M(xii) 
is processed during [2, 3]. This in turn implies that M{xi 2 ) starts at time 1, and 
thus B{xi 2 ) is processed during [0, 1], and we are done. In case operation B{xn) 
starts at time 1, a symmetric argument works. □ 

Next, we define a polynomial-time transformation S that maps feasible so- 
lutions of R{I) to feasible solutions of the Max-2Sat-B instance I. Let a be 
a feasible schedule for R{I). As described in the proof of Lemma 11, we find a 
corresponding consistent schedule a' . We define the truth setting S'(cr) for I as 
follows. If in a' operations B{xi\) and B{xi 2 ) are processed during [0,1], then 
variable Xi is set to TRUE, and if B{xn) and B{xi 2 ) are processed during [1,2], 
then Xi is set to FALSE. 

It can be verified that if a clause is satisfied under the truth setting S{a), 
then the length 1 operations on the corresponding clause machine are processed 
during [2, 3] and [3,4]. Conversely if a clause is not satisfied, then these operations 
occupy the intervals [3,4] and [4,5]. 

Lemma 12. For any instance I o/ M ax-2Sat-B, we have that OPT(i?(/)) < 
58 0 pt(/). Hence, the polynomial-time transformation R fulfills condition (1) of 
Definition 1. 

Proof. There are s clause machines and at most 2q garbage machines. Since 
the total completion time of two jobs on the same clause machine is at most 
4 4-5 = 9, the total completion time of the variable jobs that have non-zero 
processing requirement on some clause machine is at most 9s. Moreover, the total 
completion time of the remaining jobs (i.e., the jobs with non-zero processing 
time on some garbage machine) is at most (1 -I- 2 -|- 3 -I- 4)2<7 < 20s. Hence 
OPT(i?(/)) < 29s, and Lemma 10 completes the proof. □ 
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Lemma 13. For any feasible schedule a of R{I), the feasible solution S{a) of in- 
stance I fulfills the inequality |Opt(/) — c(S'((t))| < ||Opt(_R(/)) — c(cr)|. Hence, 
the polynomial-time transformation S fulfills condition (2) of Definition 1. 

Proof. As has been argued before, the length 1 operations on a clause machine 
have completion times 3 and 4, if the corresponding clause is satisfied, and they 
have completion times 4 and 5, if the clause is not satisfied. In other words, every 
unsatisfied clause induces an extra cost of 2 in the objective function. The claim 
follows. □ 

Lemma 12 and 13 state that the transformations R and S satisfy the condi- 
tions in Definition 1. Since both transformations are computable in polynomial 
time, the problem F\ \ ^Cj is Max SNP-hard. This completes the proof of 
the flow shop and of the job shop part of Theorem 8. 



4.2 Open Shop Scheduling 

The essential difference between the flow shop problem and the open shop prob- 
lem is that the order in which the operations belonging to the same job are 
processed is no longer given; therefore, we must look for a different way to en- 
force that the beginning operation indeed precedes the middle operation, which 
in turn must precede the ending operation. To this end, we introduce a number 
of additional jobs, which are used to All the interval [0, 1] on the consistency ma- 
chines and the interval [0, 2] on the clause machines; these additional jobs can 
be forced to go there, because our objective is to minimize the total completion 
time, which favors small jobs. We further need some more jobs, which are used 
to remove unnecessary idle time. This can be worked out as follows. 

Similar to the flow shop we start from an instance I = {U, C) of Max-2Sat-B. 
We introduce the same set of variable jobs, dummy jobs, assignment machines, 
consistency machines, clause machines, and garbage machines. Additionally, the 
instance R{I) contains 26q-\-6s so-called structure jobs. Every structure job con- 
sists of m — 1 operations of length 0 and of a single operation of non-zero length; 
this operation is called the structure operation corresponding to the structure 
job. 



— On each of the 2q assignment machines, we introduce five structure oper- 
ations of length 3. Because of their large processing time, any reasonable 
schedule processes them during the interval [2, 17]. 

— On each of the 2q consistency machines, we introduce three structure oper- 
ations of length i and five structure operations of length 3. Any reasonable 
schedule processes the operations of length i during [0, 1] and the operations 
of length 3 during [3, 18]. 

— On each of the s clause machines, we introduce six structure operations of 
length i. Any reasonable schedule processes them during [0,2]. 

It can be shown that for any feasible schedule for R{I), there exists a reasonable 
consistent schedule with non-larger objective value. With this, one can define a 
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truth setting S{a) like in Subsection 4.1. Again, the constructed transformations 
are polynomial time computable and fulfill the conditions in Definition 1 . Hence, 
problem O | | X) i® Max SNP-hard. This completes the proof of Theorem 8. 

5 Conclusions 

In this paper, we have derived a number of non-approximability results for 
scheduling problems with total job completion time objective. The approxima- 
bility status of most scheduling problems with this objective function or its 
weighted counterpart remains amazingly unclear: until today, all that we know 
is that some of these problems can be solved in polynomial time by straightfor- 
ward algorithms (like the problems 1 | | X and P | | X C, ) and that some 
of these problems do not have a PTAS (like the problems investigated in this 
paper). However, there is not a single strongly AfP-hard scheduling problem 
with minsum objective for which a PTAS has been constructed. We state the 
following conjectures. 

Conjecture 14. The problems 1 1 r ^ | X C/ (scheduling a single machine with 
job release dates) and P\ \ '^WjCj (scheduling parallel identical machines with 
the objective of minimizing the total weighted job completion time) both do have 
a PTAS. 

Conjecture 15. Neither of the problems P\rj \ X Q,- (scheduling parallel iden- 
tical machines with job release dates) and l|prec| X Cj (scheduling a single 
machine with precedence constraints) has a PTAS. 
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Abstract. A well studied and difficult class of scheduling problems con- 
cerns parallel machines and precedence constraints. In order to model 
more realistic situations, we consider precedence delays, associating with 
each precedence constraint a certain amount of time which must elapse 
between the completion and start times of the corresponding jobs. Re- 
lease dates, among others, may be modeled in this fashion. We provide 
the first constant-factor approximation algorithms for the makespan and 
the total weighted completion time objectives in this general class of 
problems. These algorithms are rather simple and practical forms of list 
scheduling. Our analysis also unifies and simplifies that of a number 
of special cases heretofore separately studied, while actually improving 
some of the former approximation results. 



1 Introduction 

Scheduling problems involving precedence constraints are among the most dif- 
ficult problems in the area of machine scheduling, in particular for the design 
of good approximation algorithms. Our understanding of the structure of these 
problems and our ability to generate near-optimal solutions remain limited. The 
following examples illustrate this point, (i) The first approximation algorithm 
for P|prec|Cniax by Graham [14] is not only more than thirty years old, but it is 
also still essentially the best one available for this problem. On the other hand, 
it is only known that no polynomial-time algorithm can have a better approx- 
imation ratio than 4/3 unless P = NP [23]. (ii) The computational complexity 
of the problem Fm\pj = 1, prec|Cniax, open problem 8 from the original list of 
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Garey and Johnson [11] is still open, (iii) The situation is also unsatisfactory with 
machines running at different speed, for which no constant-factor approximation 
algorithms are known. For the makespan objective, Chudak and Shmoys [10] only 
recently improved to O(logm) an almost twenty year old approximation ratio of 
0{^/m) due to Jaffe [20]. They obtained the same approximation ratio for the 
total weighted completion time objective, (iv) Progress is also quite recent for 
the latter objective on a single machine or identical parallel machines. Until re- 
cently, no constant-factor approximation algorithms were known. Lately, a better 
understanding of linear programming relaxations and their use to guide solution 
strategies led to a 2- and a 2.719-approximation algorithm for l|prec| ^wjCj 
and l|rj,prec| '^WjCj, respectively [16,34,35], and to a 5.328-approximation al- 
gorithm for P|rj, prec| ^ wjCj [7]. Few deep negative results are known for these 
problems (see [18] for the total completion times objective). 

In this paper, we consider (a generalization of) the scheduling problem 
P|rj,prec| ^WjCj and answer a question of Hall et al. [16, Page 530]: 

“Unfortunately, we do not know how to prove a good performance guar- 
antee for this model by using a simple list-scheduling variant.” 

Indeed their algorithm, as well as its improvement by Chakrabarti et al. [7], 
is rather elaborate and its performance ratio does not match the quality of the 
lower bound it uses. We show that using the same LP relaxation in a different 
way (reading the list order from the LP midpoints instead of LP completion 
times) yields a simple 4-approximation algorithm for P|rj, prec| ^ WjCj. We 
actually obtain this result in the more general framework of precedence delays. 

We consider a general class of precedence-constrained scheduling problems 
on identical parallel machines. We have a set N of n jobs and m identical par- 
allel machines. Each job j has a nonnegative processing requirement (size) pj 
and must be processed for that amount of time on any one of the machines. A 
job must be processed in an uninterrupted fashion, and a machine can process 
only one job at a time. We are interested in constrained scheduling problems 
in which each job j may have a release date rj before which it cannot be pro- 
cessed, and there may be a partial order A on the jobs. We associate with each 
precedence-constrained job pair (i,j) G A a, nonnegative precedence delay dij, 
with the following meaning: in every feasible schedule, job j cannot start until 
dij time units after job i is completed. Special cases include ordinary precedence 
constraints {dij = 0); and release dates Vj > 0 (which may be modeled by adding 
a dummy job 0 with zero processing time and precedence delays doj = Vj for 
all other jobs). Delivery times (or lags), which must elapse between the end of a 
job’s processing and its actual completion time, may also be modeled by adding 
one or several dummy jobs and the corresponding precedence delays. 

We denote the completion time of a job j in a schedule S as Cj and will 
drop the superscript S when it is clear to which schedule we refer. We con- 
sider the usual objectives of minimizing the makespan Cmax = max^ Cj and, 
for given nonnegative weights Wj > 0, a weighted sum WjCj of completion 
times. In an extension of the common notation introduced in [15], we may de- 
note these problems as P|prec. delays dylCmax and P|prec. delays dij \ ^WjCj, 
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respectively. These problems are NP-hard (see, e.g., Lawler et al. [22]), and we 
discuss here the quality of relaxations and approximation algorithms. An a- 
approximation algorithm is a polynomial-time algorithm that delivers a solution 
with objective value at most a times the optimal value. Sometimes a is called 
the (worst-case) performance guarantee or ratio of the algorithm. 

Precedence delays were considered for resource-constrained project schedul- 
ing under the name “finish-to-start lags”, e.g., by Bartusch, Mohring, and Rader- 
macher [4] and Herroelen and Demeulemeester [17], for one-machine scheduling 
by Wikum, Llewellyn, and Nemhauser [37] under the name “generalized prece- 
dence constraints”, and by Balas, Lenstra, and Vazacopoulos [3] under that of 
“delayed precedence constraints” ; the latter authors use the Lmax minimization 
problem as a key relaxation in a modified version of the shifting bottleneck 
procedure for the classic job-shop scheduling problem. Most of the theoretical 
studies concerning this class of precedence constraints consider the one ma- 
chine problem l|prec. delays dij = k,pj = l|Cniax which corresponds to a basic 
pipeline scheduling problem (see [21] for a survey). Leung, Vornberger, and Wit- 
thoff [24] showed that this problem is NP-complete. Several other authors (e.g., 
[6,5,28]) obtained polynomial-time algorithms for particular instances by utiliz- 
ing well-known algorithms for special cases of the classical m-machine problem. 
In the context of approximation algorithms, the main result is that Graham’s list 
scheduling algorithm [14] was extended to P|prec. delays dij = k,pj = l|Cmax 
to give a worst-case performance ratio of 2 — l/{m{k + 1)) [21,28]. We extend 
this result, in Section 3, to nonidentical precedence delays and processing times. 

List scheduling algorithms, first analyzed by Graham [14] are among the 
simplest and most commonly used approximate solution methods for parallel 
machine scheduling problems. These algorithms use priority rules, or job rank- 
ings, which are often derived from solutions to relaxed versions of the problems. 
For example, several algorithms of Hall, Schulz, Shmoys, and Wein [16] use the 
job completion times obtained from linear programming relaxations. In Section 2 
we show that a modified, job-driven version of list scheduling based on job com- 
pletion times can, in the presence of precedence constraints, lead to solutions 
that are about as bad as m times the optimum, for both the Cmax and WjCj 
objectives; this behavior may also occur when using actual optimum completion 
times. 

Graham’s original list scheduling, however, works well for minimizing the 
makespan as we show in Section 3 by extending it to the case of precedence de- 
lays. For minimizing a weighted sum WjCj of completion times, we present in 
Section 4 a new algorithm with approximation ratio bounded by 4 for the general 
problem with precedence delays. This algorithm is based on an LP relaxation 
of this problem, which is a straightforward extension of earlier LP relaxations 
proposed by Hall et al. [16]. The decision variables in this relaxation are the com- 
pletion times Cj of every job j, so we choose to ignore the machine assignments 
in these relaxations. There are two sets of linear constraints, one representing 
the precedence delays (and, through the use of a dummy job, the release dates) 
in a straightforward fashion; the other set of constraints is a relatively simple 
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way of enforcing the total capacity of the m machines. Although the machine 
assignments are ignored and the machine capacities are modeled in a simplistic 
way, this is sufficient to obtain the best relaxation and approximation bounds 
known so far for these problems and several special cases thereof. We show that 
using midpoints derived from the LP relaxation leads to a performance ratio 
bounded by 4 for the general problem described above. Recall that in a given 
schedule the midpoint of a job is the earliest point in time at which half of its 
processing has been performed; that is, if the schedule is (or may be considered 
as) nonpreemptive then the midpoint of job j is simply — pj /2 where is 

its completion time in the relaxation R. The advantage of using midpoints in the 
analysis of approximation algorithms was first observed by Goemans [12] and 
has since then been used by several authors (e.g., [35,36,13]). Our result seems 
to be the first, however, where midpoints are really needed within the algorithm 
itself. We also show how the analysis yields tighter bounds for some special cases, 
and then conclude with some additional remarks in Section 5. We believe that 
the approach of applying a list-scheduling rule in which the jobs are ordered 
based on their midpoints in an LP solution will have further consequences for 
the design of approximation algorithms. 

In summary, the main contributions of this paper are as follows. 

1. We clarify the relationship between two forms of List Scheduling Algorithms 
(LSAs): Graham’s non-idling LSAs and job-based LSAs. In particular, it is 
shown that the former are appropriate for optimizing objectives, such as the 
makespan Cmax, that are related to maximizing machine utilization, whereas 
they are inappropriate (leading to unbounded performance ratio) for job 
oriented objectives, such as the weighted sum of completion times WjCj. 
In contrast, we present job-based LSAs with bounded performance ratio for 
the latter objective. 

2. We show that using jo& completion times as a basis for job-based list schedul- 
ing may yield very poor schedules for problems with parallel machines, prece- 
dence constraints and weighted sum of completion times objective. This may 
happen even if the completion times are those of an optimal schedule. 

3. In contrast, we show that job-based list scheduling according to job midpoints 
from an appropriate LP relaxation leads to job-by-job error ratios of at most 
4 for a broad class of problems. 

4. We present a general model of scheduling with precedence delays. This also 
allows us to treat in a unified framework ordinary precedence constraints, 
release dates and delivery times. In particular, this simplifies and unifies the 
analysis and proof techniques. 

5. Finally, we present the best polynomial-time approximation bounds known 
so far for a broad class of parallel machine scheduling problems with prece- 
dence constraints or delays (including release dates and delivery times) and 
either a makespan or total weighted completion time objective. These bounds 
are obtained by using relatively simple LSAs which should be of practical 
as well as theoretical interest. We also present the best polynomially solv- 
able relaxations known so far for such problems with the latter objective. 
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The approximation results are summarized in Table 1 where the parameter 
p is defined a,s p = max(j min^gAr pi, and m denotes the number of 
identical parallel machines. 



Table 1. Summary of results. 



Problem 


New bound 


Best earlier bound 


Ijprec. delays d^jCmax 


2 3— 

i+p 


? 


Pjprec. delays dylCmax 


o 1 

^ m{l+p) 


? 


P prec. delays dy X 


4 


? 


P|G,Prec| 


4 


5.328 Ghakrabarti et al. [7j 


P prec X 


4_ A 
m 


5.328 Ghakrabarti et al. [7j 


Ijprec. delays dij \ ^WjCj 


3 


? 



Due to space limitations some details are omitted from this paper. They can 
be found in the complete version, see [27]. 

2 List Scheduling Algorithms 

In his seminal paper, Graham (1966) showed that a simple list-scheduling rule 
is a (2 — ^)-approximation algorithm for PjprecjCniax- In this algorithm, the 
jobs are ordered in some list, and whenever one of the m machines becomes 
idle, the next available job on the list is started on that machine, where a job 
is available if all of its predecessors have completed processing. By their non- 
idling property, Graham’s List Scheduling Algorithms (GLSAs) are well suited 
when machine utilization is an important consideration. Indeed, it is shown in 
Section 3 that, for the makespan minimization problem Pjprec. delays dyjC'max, 
any GLSA (i.e., no matter which list of jobs is used) produces a schedule with 
objective function value within a factor 2 of the optimum. In this case, a job is 
available if all its predecessors are completed and the corresponding precedence 
delays have elapsed. 

In contrast, the elementary Example 1 shows that the non-idling property 
may lead to an arbitrarily poor performance ratio for a weighted sum of com- 
pletion times objective ^j^j- 

Example 1. Gonsider the following two-job instance of the single machine non- 
preemptive scheduling problem Ijrjj ^WjCj (a special case of a precedence de- 
lay problem, as discussed in the introduction). For a parameter q >2, job 1 has 
Pi = q, Ti = 0 and wi = 1, whereas job 2 has P2 = 1, ?’2 = 1 and W2 = q^- 
The optimum schedule is to leave the machine idle during the time interval [0, 1) 
so as to process job 2 first. The optimum objective value is 2q^ + {q + 2). Any 
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non-idling heuristic starts processing job 1 at time 0, leading to an objective 
value at least + q, and its performance ratio is unbounded as q may be 

arbitrarily large. □ 

A different example of the same type but using ordinary precedence con- 
straints rather than release dates can be found in [33, Page 82, Ex. 2.20]. Thus 
to obtain a bounded performance for the weighted sum of completion times ob- 
jective WjCj, we must relax the non-idleness property. One strategy, leading 
to job-based nonpreemptive list scheduling algorithms, is to consider the jobs one 
by one, in the given list order, starting from an empty schedule. Each job is non- 
preemptively inserted in the current schedule without altering the jobs already 
scheduled. Specific list scheduling algorithms differ in how this principle is im- 
plemented, in particular, for parallel machines, regarding the assignment of the 
jobs to the machines. For definiteness, consider the following version, whereby 
every job is considered in the list order and is scheduled at the earliest feasible 
time at the end of the current schedule on a machine. Notice that the given 
list is assumed to be a linear extension of the poset defined by the precedence 
constraints. 

Job-Based List Scheduling Algorithm for Pjprec. delays dij\- 

1. The list L = (f(l),f(2), . . . ,£(n)) is given. 

2. Initially all machines are empty, with machine completion times TK = 0 for 
all = 1, . . . , TO. 

3. For fc = 1 to n do 

3.1 Let job j = £{k). Its start time is 

Sj = max(max{Ci -I- dij : (i,j) G A}, min{I)j : h = 1,. .. ,m}) 
and its completion time is Cj = Sj + pj . 

3.2 Assign job j to a machine h such that Fh < Sj. 

Update Fh = Cj. 

Various rules could be used in Step 3.2 for the choice of the assigned machine h, 
for example one with largest completion time Fh (so as to reduce the idle time 
between Fh and Sj). Note also that the above algorithm can be modified to 
allow insertion of a job in an idle period before Fh on a machine h. In effect, the 
observations below also apply to all these variants. 

One method (e.g., Phillips et al. [29] as well as Hall et al. [16]) for defining the 
list L consists in sorting the jobs in nondecreasing order of their completion times 
in a relaxation of the scheduling problem under consideration. In the presence of 
ordinary precedence constraints, this works well for the case of a single machine 
(Hall et ah, ibid., see also [34]), but Example 2 shows that this may produce very 
poor schedules for the case of identical parallel machines. This example uses the 
list which is produced by an optimal schedule, the tightest kind of relaxation 
that can be defined; note that this optimal schedule defines the same completion 
time order as the relaxation in Hall et al. and its extension in Section 4 below. 

Example 2. For a fixed number to > 2 of identical parallel machines and a 
positive number e, let the job set be N = {1, . . . , n} with n = m(jn -I- 1) -I- 1. 




Approximation Bounds for Parallel Machine Scheduling 373 



The ordinary precedence constraints (j, k) (with djk = 0) are defined as follows: 
(i) j = 1 + h{m + 1) and fc = j + g, for all /i = 0, . . . , m — 1 and all g = 1, . . . ,m; 
and (ii) for all j < n and k = n. The processing times are pj = 1 + h(m + l)e for 
j = 1 + h{m + 1) and h = 0, . . . ,m — 1; and pj = e otherwise. The objective is 
either to minimize the makespan, or a weighted sum WjCj of job completion 
times with weights Wj = 0 for all j < n and = 1; note that, due to the 
precedence constraints (ii) above, these two objectives coincide for any feasible 
schedule. 

An optimal solution has, for ft, = 0, . . . , m — 1, job j = 1 + h{m + 1) starting 
at time S'* = 0 on machine ft + 1, immediately followed by jobs j + 1, . . . , j + m 
assigned as uniformly as possible to machines 1, . . . , ft+1. Job n is then processed 
last on machine m, so that the optimal objective value is = C* = 1 + {rn^ + 
l)e. A corresponding list is L = (1, 2, . . . , n). Any version of the list scheduling 
algorithm described above produces the following schedule from this list: job 1 
is scheduled with start time 5^ = 0 and completion time C'f=l; the m jobs k = 
2, . . . , TO + 1 are then scheduled, each with = 1 and = 1 + e on a different 
machine; this will force all subsequent jobs to be scheduled no earlier than time 
1 + e. As aresult, for ft = 0, . . . ,to— 1, job j = l + ft(TO+l) is scheduled with start 
time = ft+ (^(ft — l)ft(TO+ l) + ft)e, followed by jobs fc = j + 1, . . . , j + (to+ 1) 
each with = h + l + (|ft(ft + 1 )(to + 1) + ft)e on a different machine. Finally, 
job n is scheduled last with = to + (|(to — 1)to(to + 1) + m)e and thus the 
objective value is = m + o(e), or arbitrarily close to to times the 

optimal value for e > 0 small enough. □ 

The example shows that list scheduling according to completion times can 
lead to poor schedules on identical parallel machines with job precedence con- 
straints. In contrast, we will present in Section 4 a linear programming relax- 
ation of the general problem with precedence delays and show that job-based 
list scheduling according to job midpoints leads to a bounded performance ratio. 

3 The Performance of Graham’s List Scheduling 

In this section we show that Graham’s (non-idling) List Scheduling Algorithms 
generate feasible schedules with makespan less than twice the optimum for in- 
stances of P|prec. delays dijjC'max- As discussed in the introduction, this result 
and proof extend and unify earlier work. 

Let = {S^ , . . . , S^) denote the vector of start times of a schedule con- 
structed by GLSA, as described in Section 2. Let = max^g + Pi) 

denote the makespan of this schedule. For any pair of dates such that 

0 < t < t' < let T[t, t') denote the total machine idle time during the 

interval (Thus, for example, if all to machines are idle during the whole 

interval, then = m{t' — t).) Let N'^ denote the set of jobs in N that have 

at least one predecessor. 

Lemma 1. Let j be a job in and let i be an immediate predecessor of j with 
largest value + Pi + dij . Then T[Sl^ , S^) < to dij + {m — l)pi- 
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Proof. Let i be an immediate predecessor of j with largest value of 5/^ + Pi + 
dij. Since iL is a GLSA, job j may be processed by any available processor 
from date + Pi + dij on. Therefore + Pi + dij,S^) = 0 and hence 

X[Sj^,Sf) = + Pi + dij). Since job i is processed during the time 

interval + Pi), at most m — 1 machines are idle during this interval, and 

we obtain the requisite inequality. □ 

The precedence network {N^,A^,£) is defined by fVo = -^ U {0}; Aq = A U 
{{i, 0) : i G N}] and arc lengths i{i,j) = Pi + dij for all (i,j) € A, and £{i, 0) = pi 
for all i € A^. Job 0 is a dummy job succeeding all other jobs with zero delays. It is 
added so that any maximal path in the precedence network is of the form i ... jO 
and its length is the minimum time required to process all the jobs 
Recall from the introduction that p = max(j djfc/minigArpi. We assume 
that m.a,X(j^k)&A djk + naixii^N pi > 0; otherwise we have ordinary precedence 
constraints and we know that, for the makespan objective (and since we only 
consider nonpreemptive schedules), jobs with zero processing time can then be 
simply eliminated. If some pi = 0 (with all djk > 0) then we may use p = +oo 
and the ratio 1/(1 + p) =0. 

Lemma 2. Let L denote the length of a longest path in the precedence network. 
Then I[0, < (to - 1/(1 + p))L . 

Proof. Let io & N such that +Pio = Starting with q = 0, we construct 

as follows a path (i^, ik-i , . . . ,ig) in the precedence network: 

1. While ig G fV+ do: choose as Zg+i an immediate predecessor of ig with largest 
value 

2. Set k = q and stop. 

Since ik has no predecessor, X[0, S/^) = 0. By repeated application of 
Lemma 1, we have 5,^) < to X)g=o + (^ - 1) E 9 =i ■ In addition, 

, C'max) < (m- l)p*o, and therefore + (to - 

I) Y!1=oPh - Til® lol^l length of the path satisfies di^+i^„ + J2g=oPiq ^ T. 

If mini^N Pi = 0 then we immediately have < mL and the proof is 

complete. Otherwise, < P Pi < pY.q=lPi<,- There- 
fore, X)q=o ^iq+iic, — and the lemma follows. □ 

We are now in the position to prove the main result of this section. 



Theorem 1. For the scheduling problem Pjprec. delays d^jOniax; the perfor- 
mance ratio of Graham’s List Scheduling Algorithm is 2 — 1/(to(1 J- p)), no 
matter which priority list is used. 

Proof We have to = ^i^j^iPi + T[0, and thus, by Lemma 2, 



C, 



H ^ ThieN Pi 

max — _ 

m 




1 



to( 1 J- p) 



L . 



Since T jgPi and L are two lower bounds on the optimum makespan, the 
result follows. □ 
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There exist instances showing that the bound in Theorem 1 is (asymptoti- 
cally) best possible, even for to = 1 [27] . 

4 The Performance of Job-Based List Scheduling 



In this section we first present a linear programming relaxation of the problem 
of minimizing a weighted sum ^ WjCj of job completion times subject to prece- 
dence delays, which is then used in the design of a 4-approximation algorithm 
for this problem. This formulation is a direct extension of a formulation given in 
[16], see also [33]. The decision variables are the job completion times Cj for all 
jobs j G N. Note that this formulation does not take into account the assignment 
of jobs to machines. The set of constraints is: 





Ci -\- dij 


+ Pj 


all {i,j) G A, 


(1) 








all FCN. 


(2) 


ieF 




ieF 







Constraints (1) are the precedence delay constraints. Constraints (2) are a rela- 
tively weak way of expressing the requirement that each machine can process at 
most one job at a time; for the single-machine case (to = 1) they were introduced 
by Wolsey [38] and Queyranne [30], and studied by Queyranne and Wang [31], 
von Arnim, Schrader, and Wang [1], von Arnim and Schulz [2], and Schulz [33] 
in the presence of ordinary precedence constraints; they were extended to to > 2 
parallel machines by Hall et al. [16]. Note that these constraints, for F = {j}, 
imply Cj > 0; these and, as indicated above, the use of a dummy job 0 allow 
the formulation of release date constraints. 

For a weighted sum of completion times objective, the LP formulation is 
simply: 



minimize ^ WjCj subject to (1) - (2). (3) 

jGN 

Let denote any feasible solution to the constraint set (l)-(2) of this LP; 
we will call the LP completion time of job j. We now use this LP solution 
to define a feasible schedule with completion time vector and analyze the 
job-by-job relationship between and for every job j G N. 

We define the LP start time and LP midpoint as —pj 

and Mj"^ = Cj^ —pjj2, respectively. We now use the List Scheduling Algo- 
rithm of Section 2 with the LP midpoint list L defined by sorting the jobs in 
nondecreasing order of their midpoints . The next theorem contains our 
main result. 
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Theorem 2. Let denote any feasible solution to the constraint set (l)-(2) 

and let and denote the associated LP midpoints and start times, re- 

spectively. Let be the vector of start times of the feasible schedule constructed 
by the List Scheduling Algorithm using the LP midpoint list. Then 

Sf < 2 + 2 for all jobs j G N. (4) 

Proof. Assume for simplicity that the jobs are indexed in the order of their LP 
midpoints, that is, < • • • < ■ We fix job j G N and consider 

the schedule constructed by the List Scheduling heuristic using the LP midpoint 
list L = (1, 2, . . . , n) up to and including the scheduling of job j, that is, up to 
the completion of Step 3 with k = j. Let [j] = {1, . . . , j}. 

Let pL denote the total time between 0 and start time of job j when all m 
machines are busy at this stage of the algorithm. Since only jobs in [j — 1] have 
been scheduled so far, we have ^ Sti P*- ^ ~ T- To prove (4), 

we only need to show that 

(i) — and (ii) A < 2.5^^ . 

^ i=l 

Inequality (i) follows from a straightforward variant of Lemma 3.2 in [16]. We 
omit the details here and refer the reader to the full version of this paper, see 
[27]. To show (ii), let q denote the number of time intervals between dates 0 
and when at least one machine is idle (i.e., not processing a job in [j — 1]) 
in the schedule . Denote these idle intervals as {ah,bh) for h= l,...,q, so 
that 0 < oi; that bh-i < an < bh for all h = 2,. . . ,q; and that bq < Sf. We 
have A = — clh) and all machines are busy during the complementary 

intervals \bh,ah+i], including intervals [0,oi] and \bq,S^] if nonempty. 

Consider the digraph = ([j], where 

= {{k,i) G A-. k,iG [j] and = Cf + du -Gpi} , 

that is, is the set of precedence pairs in [j] for which the precedence delay 
constraints (1) are tight for . If 6, > 0 then a machine becomes busy at date bq 
(or starts processing job j if bq = and thus there exists a job x{q) G [j] with 
start time >5'^^) = bq. We repeat the following process for decreasing values 
of the interval index h, starting with h = q, until we reach the date 0 or the 
busy interval [0, oi]. Let (u(l), . . . , u(s)) denote a maximal path in Gl-'l with 
last node (job) v(s) = x{h). Note that we must have bg < < %+! for 

some busy interval \bg,ag+i] with Og+i < bh, for otherwise some machine is 
idle immediately before the start time of job u(l) and this job, not being 
constrained by any tight precedence delay constraint, should have started earlier 
than that date. This implies in particular that s > 2. We have 

s-l s-1 

bh - cig+i < ^ ^ {Pv(i) + i^«(i)«(i+i)) -(5) 

i=l i=l 
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On the other hand, the precedence delay constraints (1) imply 

^v^+l) > + -jPvii) + + -jPv(i+l) 

for alH = 1, . . . , s — 1. Therefore 

1 1 

^x(h) ~ ^v{l) — l^Px(h) T 2 ^ / {Pv(i) + 

^ i=l 

and thus 

S^fn) - > ^(6/. - a.+i) ■ 

If bg > 0 , then let x{g) denote a job with start time satisfying bg < < 

Og+i and with minimum value of under this condition. Therefore ^ 

^v(i)- have {k,x{g)) G for some k £ [j] with < bg < for 

otherwise job x{g) should have started processing on some idle machine before 
date bg. This implies 



Sx(h) - M^{g) ag+i) ( 6 ) 

and we may repeat the above process with h = g and job x{h) = x{g). Since 
g < h ai each step, this whole process must terminate, generating a decreasing 
sequence of indices q = h{l) > ... > h{q') = 0 such that every idle interval 
is contained in some interval [ah(i+i)+i,bh(i)]. Adding the inequalities (6) and 
using for all j = h{i), we obtain 

A < < 2{Sf - 0) . (7) 

i=l 

This establishes (ii). The proof of Theorem 2 is complete. □ 

There exist instances showing that the factors 2 in inequality (4) are (asymp- 
totically) best possible for any number of machines, see [27]. 

Using for an optimal LP solution. Theorem 2 implies performance ratios 
of 1/4 and 4 for the LP relaxation and the heuristic solution, respectively, for 
the ^WjCj objective. 

Corollary 1. Let denote an optimal solution to the LP defined in (3) for 
the problem Pjprec. delays dif ^wjCj. Let denote the solution constructed 
from by the List Scheduling Algorithm using the LP midpoint list, and let 
C* denote an optimum schedule. Then 

WjCY H H 

j&N j&N j&N j&N 
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Examples show that the latter bound is (asymptotically) tight for an arbi- 
trary number of machines. We suspect that the first inequality in (8), bounding 
the performance ratio of the LP relaxation, is not tight. The worst instances we 
know have a performance ratio of 1 /3 for the LP relaxation, see [27] for details. 

The analysis in Theorem 2 may be refined for some special cases, yielding 
tighter performance ratios. For the problem Pjprecj ^ observe that the 

list scheduling algorithm will not allow all machines to be simultaneously idle 
at any date before the start time of any job i € N. Therefore, in the proof of 
Theorem 2, all the idle intervals, with total length A, contain some processing 
of some job(s) i < j; as a result the total work during the busy intervals is at 
most Pi ~ Hence, we obtain the following result. 

Corollary 2. List scheduling by LP midpoints is a {A — 2 /m) -approximation 
algorithm for Pjprecj ^ WjCj . 

Note that for m = 1 we recover the performance ratio of 2 for Ijprecj ^ WjCj 
in [16,34], which is known to be tight for that special case. 

In the case of a single machine, the idle intervals that add up to A time units 
cannot contain any processing. Therefore, in the proof of Theorem 2 replace 
inequality (5) with bh - Og+i < • Adding up the 

precedence delay constraints for alH = 1, . . . , s — 1 and omitting some processing 
times yield dv{^)v{^+l) and thus > 

bh - ag+i. Therefore we may replace (7) with A < ~ a?i(i+i)+i) < 

~ ^x{h(q')) — inequality (ii) with A < . This implies 

Sf < 3M/^. 

Corollary 3. List scheduling by LP midpoints is a 3 -approximation algorithm 
for 1 jprec. delays dij \ ^ WjCj . 

Note that for the special case l|rj,prec| ^WjCj we recover the performance 
ratio of 3 in [16,34]. The best known approximation algorithm for this problem, 
however, has a performance guarantee of e « 2.719 [35]. 

5 Concluding Remarks 

The appropriate introduction of idle time is a rather important part in the design 
of approximation algorithms to minimize the weighted sum of completion times 
subject to precedence delays. As Example 1 illustrates, idle time is needed to 
avoid that profitable jobs which become available soon are delayed by other, less 
important jobs. On the other hand, too much idle time is undesired as well. The 
necessity to balance these two effects contributes to the difficulty of this problem. 
Interestingly, all former approximation algorithms for P\rj,prec\'^WjCj with 
constant-factor performance ratios are based on variants of Graham’s original 
list scheduling which actually tries to avoid machine idle time. In fact. Hall 
et al. [16] partition jobs into groups that are individually scheduled according 
to a GLSA, and then these schedules are concatenated to obtain a solution of 
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the original problem. To find a good partition, this scheme was enriched with 
randomness by Chakrabarti et al. [7]. Chekuri et al. [8] presented a different 
variant of a GLSA by artificially introducing idle time whenever it seems that 
a further delay of the next available job in the list (if it is not the first) can be 
afforded. Hence, the algorithm analyzed in Section 4 seems the first within this 
context that does not take the machine-based point of view of GLSAs. 

However, an advantage of the rather simple scheme of Ghekuri et al. is its 
small running time (though the performance ratios obtained are considerably 
worse). In fact, so far we have not even explained that the algorithms presented 
above are indeed polynomial-time algorithms. Whereas this is obvious for the 
GLSA variant for makespan minimization, we have to argue that in case of the 
total weighted completion time the linear programming relaxation (3) behaves 
well. In fact, it can be solved in polynomial time since the corresponding sepa- 
ration problem is polynomially solvable [33] . 

It seems worth to note that Theorem 2 actually implies stronger results than 
stated in Gorollaries 1 and 3. The performance ratios given there not only hold 
for the weighted sum of completion times, but even for the weighted sum of 
third-points and midpoints, respectively. Recall that, in a given schedule and for 
a given value 0 < a < 1, the a-point of job j is the earliest time at which a 
fraction a of its processing has been performed; thus if the schedule is (or may be 
considered as) nonpreemptive, then the a-point of job j is simply Cj — {l — a)pj. 
Talking about a-points, it is interesting to note that in the proof of Theorem 2 
other a~points could be used in this analysis, provided that \ < a <1, but the 
midpoint (a = 1/2) leads to the best bound. 

Finally, let us relate precedence delays to another kind of restrictions that has 
been given quite some attention recently. Assume that precedence constraints 
(t,j) are caused by some technological requirements. Then it seems reasonable 
that the information or object produced during the processing of job i needs to 
be transferred to job j before the processing of j can start. Hence, precedence 
delays might be interpreted as kind of communication delays. This term is coined 
with a slightly different meaning, however (see, e.g., [32,19,9,25,26]). In fact, in 
this context one usually assumes that a delay only occurs if i and j are assigned 
to different machines. Our results can be extended to the model with precedence 
and communication delays. For details, we again refer the reader to the full 
version of this paper [27]. 



Acknowledgments 

This work was initiated during a workshop on “Parallel Scheduling” held at 
SchloB Dagstuhl, Germany, July 14-18, 1997. The authors are grateful to the 
organizers for providing a stimulating atmosphere. The research of the second 
author is supported in part by a research grant from NSERG, (the Natural Sci- 
ences and Research Gouncil of Ganada) and by the UNI. TU. RIM. S.p.a. (Societa 
per I’Universita nel riminese), whose support is gratefully acknowledged. 




380 



Alix Munier et al. 



References 

1. A. von Arnim, R. Schrader, and Y. Wang. The permutahedron of N-sparse posets. 
Mathematical Programming, 75:1-18, 1996. 

2. A. von Arnim and A. S. Schulz. Facets of the generalized permutahedron of a 
poset. Discrete Applied Mathematics, 72:179-192, 1997. 

3. E. Balas, J. K. Lenstra, and A. Vazacopoulos. The one machine problem with 
delayed precedence constraints and its use in job-shop scheduling. Management 
Science, 41:94-109, 1995. 

4. M. Bartusch, R. H. Mohring, and F. J. Radermacher. Scheduling project net- 
works with resource constraints and time windows. Annals of Operations Research, 
16:201-240, 1988. 

5. D. Bernstein and I. Gertner. Scheduling expressions on a pipelined processor with 
a maximum delay of one cycle. ACM Transactions on Programming Languages 
and Systems, 11:57 - 66, 1989. 

6. J. Bruno, J. W. Jones, and K. So. Deterministic scheduling with pipelined proces- 
sors. IEEE Transactions on Computers, 0-29:308-316, 1980. 

7. S. Chakrabarti, C. A. Phillips, A. S. Schulz, D. B. Shmoys, C. Stein, and J. Wein. 
Improved scheduling algorithms for minsum criteria. In F. Meyer auf der Heide 
and B. Monien, editors. Automata, Languages and Programming, LNCS, Vol. 1099, 
pages 646-657. Springer, Berlin, 1996. 

8. C. S. Chekuri, R. Motwani, B. Natarajan, and C. Stein. Approximation techniques 
for average completion time scheduling. In Proceedings of the 8th ACM-SIAM 
Symposium on Discrete Algorithms, pages 609-618, 1997. 

9. P. Chretienne and C. Picouleau. Scheduling with communication delays: A survey. 
In P. Chretienne, E. G. Goffman Jr., J. K. Lenstra, and Z. Liu, editors, Scheduling 
Theory and its Applications, chapter 4, pages 65-90. John Wiley & Sons, 1995. 

10. F. A. Chudak and D. B. Shmoys. Approximation algorithms for precedence- 
constrained scheduling problems on parallel machines that run at different speeds. 
In Proceedings of the 8th ACM-SIAM Symposium on Discrete Algorithms, pages 
581-590, 1997. 

11. M. R. Garey and D. S. Johnson. Computers and Intractability: A Guide to the 
Theory of NP-Completeness. Freeman, San Francisco, 1979. 

12. M. X. Goemans. Improved approximation algorithms for scheduling with release 
dates. In Proceedings of the 8th ACM-SIAM Symposium on Discrete Algorithms, 
pages 591-598, 1997. 

13. M. X. Goemans, M. Queyranne, A. S. Schulz, M. Skutella, and Y. Wang. Single 
machine scheduling with release dates. Working paper, 1997. 

14. R. L. Graham. Bounds for certain multiprocessing anomalies. Bell System Tech. 
J., 45:1563-1581, 1966. 

15. R. L. Graham, E. L. Lawler, J. K. Lenstra, and A. H. G. Rinnooy Kan. Opti- 
mization and approximation in deterministic sequencing and scheduling: A survey. 
Annals of Discrete Mathematics, 5:287-326, 1979. 

16. L. A. Hall, A. S. Schulz, D. B. Shmoys, and J. Wein. Scheduling to minimize average 
completion time: Off-line and on-line approximation algorithms. Mathematics of 
Operations Research, 22:513-544, 1997. 

17. W. Herroelen and E. Demeulemeester. Recent advances in branch-and-bound pro- 
cedures for resource-constrained project scheduling problems. In P. Ghretienne, 
E. G. Coffman Jr. , J. K. Lenstra, and Z. Liu, editors. Scheduling Theory and its 
Applications, chapter 12, pages 259-276. John Wiley & Sons, 1995. 




Approximation Bounds for Parallel Machine Scheduling 381 



18. J. A. Hoogeveen, P. Schuurman, and G. J. Woeginger. Non-approximability results 
for scheduling problems with minsum criteria. In R. E. Bixby, E. A. Boyd, and 
R. Z. Ri'os-Mercado, editors, Proceedings of the 6th International IPCO Conference, 
LNCS, Vol. 1412 , pages 344-357. Springer, 1998. This volume. 

19. J. A. Hoogeveen, B. Veltman, and J. K. Lenstra. Three, four, five, six, or the 
complexity of scheduling with communication delays. Operations Researeh Letters, 
3:129-137, 1994. 

20. J. M. Jaffe. Efficient scheduling of tasks without full use of processor resources. 
Theoretical Computer Science, 12:1-17, 1980. 

21. E. Lawler, J. K. Lenstra, C. Martel, B. Simons, and L. Stockmeyer. Pipeline 
scheduling: A survey. Technical Report RJ 5738 (57717), IBM Research Division, 
San Jose, California, 1987. 

22. E. L. Lawler, J. K. Lenstra, A. H. G. Rinnooy Kan, and D. B. Shmoys. Sequencing 
and scheduling: Algorithms and complexity. In S. C. Graves, A. H. G. Rinnooy Kan, 
and P. H. Zipkin, editors. Logistics of Production and Inventory, Handbooks in 
Operations Research and Management Science, Vol. 4, chapter 9, pages 445-522. 
North-Holland, Amsterdam, The Netherlands, 1993. 

23. J. K. Lenstra and A. H. G. Rinnooy Kan. Complexity of scheduling under prece- 
dence constraints. Operations Research, 26:22-35, 1978. 

24. J. Y.-T. Leung, O. Vornberger, and J. Witthoff. On some variants of the bandwidth 
minimization problem. SIAM J. Computing, 13:650-667, 1984. 

25. R. H. Mohring, M. W. Schaffter, and A. S. Schulz. Scheduling jobs with com- 
munication delays: Using infeasible solutions for approximation. In J. Diaz and 
M. Serna, editors, Algorithms - ESA’96, LNCS, Vol. 1136, pages 76-90. Springer, 
Berlin, 1996. 

26. A. Munier and J.-C. Konig. A heuristic for a scheduling problem with communi- 
cation delays. Operations Researeh, 45:145-147, 1997. 

27. A. Munier, M. Queyranne, and A. S. Schulz. Approximation bounds for a 
general class of precedence constrained parallel machine scheduling problems. 
Preprint 584/1998, Department of Mathematics, Technical University of Berlin, 
Berlin, Germany, 1998. ftp://ftp.math.tu-berlin.de/pub/Preprints/combi/ 
Report-584-1998.ps . Z 

28. K. W. Palem and B. Simons. Scheduling time critical instructions on RISG ma- 
chines. In Proceedings of the 17th Annual Symposium on Principles of Programming 
Languages, pages 270-280, 1990. 

29. C. Phillips, C. Stein, and J. Wein. Scheduling jobs that arrive over time. In 
Proceedings of the Fourth Workshop on Algorithms and Data Structures, LNCS, 
Vol. 955, pages 86-97. Springer, Berlin, 1995. 

30. M. Queyranne. Structure of a simple scheduling polyhedron. Mathematical Pro- 
gramming, 58:263-285, 1993. 

31. M. Queyranne and Y. Wang. Single-machine scheduling polyhedra with precedence 
constraints. Mathematics of Operations Research, 16:1-20, 1991. 

32. V. J. Rayward-Smith. UET scheduling with unit interprocessor communication 
delays. Discrete Applied Mathematics, 18:55-71, 1987. 

33. A. S. Schulz. Polytopes and Scheduling. PhD thesis, Technical University of Berlin, 
Berlin, Germany, 1996. 

34. A. S. Schulz. Scheduling to minimize total weighted completion time: Performance 
guarantees of LP-based heuristics and lower bounds. In W. H. Cunningham, S. T. 
McCormick, and M. Queyranne, editors. Integer Programming and Combinatorial 
Optimization, LNCS, Vol. 1084, pages 301-315. Springer, Berlin, 1996. 




382 



Alix Munier et al. 



35. A. S. Schulz and M. Skutella. Random-based scheduling: New approximations and 
LP lower bounds. In J. Rohm, editor, Randomization and Approximation Tech- 
niques in Computer Science, LNCS, Vol. 1269, pages 119-133. Springer, Berlin, 
1997. 

36. A. S. Schulz and M. Skutella. Scheduling-LPs bear probabilities: Randomized 
approximations for min-sum criteria. In R. Burkard and G. Woeginger, editors. 
Algorithms - ESA’97, LNCS, Vol. 1284, pages 416-429. Springer, Berlin, 1997. 

37. E. D. Wikum, D. C. Llewellyn, and G. L. Nemhauser. One-machine generalized 
precedence constrained scheduling problems. Operations Research Letters, 16:87- 
89, 1994. 

38. L. A. Wolsey, August 1985. Invited Talk at the 12th International Symposium on 
Mathematical Programming, MIT, Gambridge. 




An Efficient Approximation Algorithm for 
Minimizing Makespan on Uniformly Related 

Machines 



Chandra Chekuri^* and Michael Bender^ 

^ Computer Science Department 
Stanford University 
chekuri@@cs . stanford.edu 
^ Division of Applied Sciences 
Harvard University 
bender@@deas . harvard . edu 



Abstract. We give a new efficient approximation algorithm for schedul- 
ing precedence constrained jobs on machines with different speeds. The 
setting is as follows. There are n jobs 1, . . . , n where job j requires pj 
units of processing. The jobs are to be scheduled on a set of m machines. 
Machine i has a speed Si; it takes Pj/si units of time for machine i to pro- 
cess job j. The precedence constraints on the jobs are given in the form of 
a partial order. If j -< k, processing of k cannot start until j’s execution if 
finished. Let Cj denote the completion time of job j. The objective is to 
find a schedule to minimize Cmax = maxj Cj , conventionally called the 
makespan of the schedule. We consider non-preemptive schedules where 
each job is processed on a single machine with no preemptions. Recently 
Chudak and Shmoys [1] gave an algorithm with an approximation ra- 
tio of O(logm) significantly improving the earlier ratio of 0{^/m) due 
to Jaffe [7]. Their algorithm is based on solving a linear programming 
relaxation of the problem. Building on some of their ideas, we present 
a combinatorial algorithm that achieves a similar approximation ratio 
but runs in 0{n^) time. In the process we also obtain a constant factor 
approximation algorithm for the special case of precedence constraints 
induced by a collection of chains. Our algorithm is based on a new lower 
bound which we believe is of independent interest. By a general result 
of Shmoys, Wein, and Williamson [10] our algorithm can be extended to 
obtain an O(logm) approximation ratio even if jobs have release dates. 
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1 Introduction 

The problem of scheduling precedence constrained jobs on a set of identical 
parallel machines to minimize makespan is one of the oldest problems for which 
approximation algorithms have been devised. Graham [4] showed that a simple 
list scheduling gives a ratio of 2 and it is the best known algorithm till date. 
We consider a generalization of this model in which machines have different 
speeds. In the scheduling literature such machines are called uniformly related. 
We formalize the problem below. There are n jobs 1, . . . , n, with job j requiring 
processing of pj units. The jobs are to be scheduled on a set of m machines. 
Machine i has a speed factor Si. Job j with a processing requirement pj takes 
Pj / Si time units to run on machine i. Let Cj denote the completion time of 
job j. The objective is to find a schedule to minimize Cmax = maxj Cj called 
the makespan of the schedule. We restrict ourselves to non-preemptive schedules 
where a job once started on a machine has to run to completion on the same 
machine. Our results carry over to the preemptive case as well. In the scheduling 
literature [5] where problems are classified in the a|/3|7 notation, this problem 
is referred to as Q\prec\Cmax- 



Liu and Liu [9] analyzed the performance of Graham’s list scheduling algo- 
rithm for the case of different speeds and showed that the approximation guar- 
antee depends on the ratio of the largest to the smallest speed. This ratio could 
be arbitrarily large even for a small number of machines. The first algorithm to 
have a bound independent of the speeds was given by Jaffe [7]. He showed that 
list scheduling restricted to the set of machines with speeds that are within a 
factor of l/>/m of the fastest machine speed results in an 0{^/m) bound. More 
recently, Ghudak and Shmoys [1] improved the ratio considerably and gave an 
algorithm which has a guarantee of O(logm). At a more basic level their algo- 
rithm has a guarantee of 0{K) where K is the number of distinct speeds. The 
above mentioned algorithm relies on solving a linear programming relaxation 
and uses the information obtained from the solution to allocate jobs to proces- 
sors. We present a new algorithm which finds an allocation without solving a 
linear program. The ratio guaranteed by our algorithm is also 0(log m) but is 
advantageous for the following reason. Our algorithm runs in 0{n^) time and 
is combinatorial, hence is more efficient than the algorithm in [1]. Further, the 
analysis of our algorithm relies on a new lower bound which is very natural, 
and might be useful in other contexts. In addition we show that our algorithm 
achieves a constant factor approximation when the precedence constraints are 
induced by a collection of chains. We remark here that our work was inspired 
by, and builds upon the ideas in [1]. 



The rest of the paper is organized as follows. Section 2 contains some of 
the ideas from the paper of Ghudak and Shmoys [1] that are useful to us. We 
present our lower bound in Section 3, and give the approximation algorithm and 
the analysis in Section 4. 
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2 Preliminaries 



We summarize below the basic ideas in the work of Chudak and Shmoys [1]. 
Their main result is an algorithm which gives a ratio of 0{K) for the problem 
of <5|prec|Cniax where K is the number of distinct speeds. They also show how 
to reduce the general case with arbitrary speeds to one in which there are only 
O(logm) distinct speeds as follows. 

— Ignore all machines with speed less than 1/m times the speed of the fastest 
machine. 

— Round down all speeds to the nearest power of 2. 



They observe that the above transformation can be done while losing only a con- 
stant factor in the approximation ratio. Using this observation, we will restrict 
ourselves to the case where we have K distinct speeds. 

When all machines have the same speed {K = 1), Graham [4] showed that 
list scheduling gives a 2 approximation. His analysis shows that in any schedule 
produced by list scheduling, we can identify a chain of jobs ji A j 2 . . . A jr such 
that a machine is idle only when one of the jobs in the above chain is being 
processed. The time spent processing the above chain is a lower bound on the 
optimal makespan. In addition, the measure of the time instants during which 
all machines are busy is also a lower bound by arguments about the average 
load. These two bounds provide an upper bound of 2 on the approximation ratio 
of list scheduling. One can apply a similar analysis for the multiple speed case. 
As observed in [1], the difficulty is that the time spent in processing the chain 
identified from the list scheduling analysis, is not a lower bound. The only claim 
that can be made is that the processing time of any chain on the fastest machine 
is a lower bound. However the jobs in the chain guaranteed by the list scheduling 
analysis do not necessarily run on the fastest machine. Based on this observation, 
the algorithm in [1] tries to find an assignment of jobs to speeds (machines) that 
ensures that the processing time of any chain is bounded by some factor of the 
optimal. 

We will follow the notation of [1] for sake of continuity and convenience. 
Recall that we have K distinct speeds. Let mk be the number of machines 
with speed Sk, k = 1, . . . ,RT, where si > ... > sk- Let denote the sum 
J2k=u sequel we will be interested in assigning jobs to speeds. For 

a given assignment, let k{j) denote the speed at which job j is assigned to be 
processed. The average processing allocated to a machine of a specific speed k, 
denoted by D^, is the following. 



Dk 



1 

nikSk 



Pj- 

j-.k{j)=k 



A chain is simply a subset of jobs which are totally ordered by the precedence 
constraints. Let V be the set of all chains induced by the precedence constraints. 
We compute a quantity C defined by the following equation. 



C' = maxV 

^k(j) 
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A natural variant of list scheduling called speed based list scheduling is developed 
in [1] which is constrained to schedule according to the speed assignments of the 
jobs. In classical list scheduling, the first available job from the list is scheduled 
as soon as a machine is free. In speed based list scheduling, an available job is 
scheduled on a free machine provided the speed of free machine matches the 
speed assignment of the job. The following theorem is from [1]. The analysis is 
a simple generalization of Graham’s analysis of list scheduling. 

Theorem 1 (Chudak and Shmoys). For any job assignment k{j), j = 1, . . . , 
n, the speed-based list scheduling algorithm produces a schedule of length 

K 

Gniax ^ G + ^ ' Dk- 

k=l 

The authors of [1] use a linear programming relaxation of the problem to 
obtain a job assignment that satisfies the two conditions: Gfc < (A" + 

and C < where is the optimal makespan. Com- 

bining these with Theorem 1 gives them an 0{K) approximation. We will show 
how to use an alternative method based on chain decompositions to obtain an 
assignment satisfying similar properties. 

3 A New Lower Bound 

In this section we develop a simple and natural lower bound that will be used in 
the analysis of our algorithm. Before formally stating the lower bound we provide 
some intuition. The two lower bounds used in Graham’s analysis for identical 
parallel machines are the maximum chain length and the average load. As dis- 
cussed in the previous section, a naive generalization of the first lower bound 
implies that the maximum chain length (a chain’s length is sum of processing 
times of jobs in it) divided by the fastest speed is a lower bound. However it is 
easy to generate examples where the maximum of this bound and the average 
load is 0(1 /m) times the optimal. We describe the general nature of such exam- 
ples to motivate our new bound. Suppose we have two speeds with 'si = D and 
S 2 = 1. The precedence constraints between the jobs are induced by a collection 
of £ > I chains, each of the same length D. Suppose mi = 1, and m 2 = £ • D. 
The average load can be seen to be upper bounded by 1. In addition the time 
to process any chain on the fastest processor is 1. However if I? ^ ^ it is easy to 
observe that the optimal is n{£) since only £ machines can be busy at any time 
instant. The key insight we obtain from the above example is that the amount 
of parallelism in an instance restricts the number of machines that can be used. 
We capture this insight in our lower bound in a simple way. We need a few defi- 
nitions to formalize the intuition. We view the precedence relations between the 
jobs as a weighted poset where each element of the poset has a weight associated 
with it that is the same as the processing time of the associated job. We will 
further assume that we have the transitive closure of the poset. 
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Definition 1. A chain P is a set of jobs ji, ■ ■ ■ ,jr such that for all 1 < i < r, 
ji ~< ji+i- The length of a chain P, denoted by |P|, is the sum of the processing 
times of the jobs in P. 



Definition 2. A chain decomposition V of a set of precedence constrained jobs 
is an partition of the poset in to a collection of chains {P\,P 2 , . . . , Pr}- A maxi- 
mal chain decomposition is one in which Pi is a longest chain and {P 2 , . ■ . , Pr} 
is a maximal chain decomposition of the poset with elements of Pi removed. 



Though we define a maximal chain decomposition as a set of chains, we will 
implicitly assume that it is an ordered set, that is |Pi| > IP2I > ■ • • |Pr|- 



Definition 3. Let V = {Pi, P 2 , . . . , P^} be any maximal chain decomposition of 
the precedence graph of the jobs. We define a quantity called L-p associated with 
V as follows. 



Lp 



max 



ELi 
> ELi 



Si 



With the above definitions in place we are ready to state and prove the new 
lower bound. 



Theorem 2. Let V = {Pi, P 2 , ..., Pr} be any maximal chain decomposition of 
the precedence graph of the jobs. Let AL = s*) which represents 

the average load. Then 

Cmax > max{AL,P-p}. 

Moreover the lower bound is valid for the preemptive case as well. 



Proof. It is easy to observe that > AL. We will show the following for 

1 < j < TO 



^max 



> 






which will prove the theorem. Consider the first j chains. Suppose our input 
instance was modified to have only the jobs in the first j chains. It is easy to see 
that a lower bound for this modified instance is a lower bound for the original 
instance. Since it is possible to execute only one job from each chain at any 
time instant, only the fastest j machines are relevant for this modified instance. 
The expression (X)i=i l-f’i|)/(Si=i i® simply the average load for the modified 
instance, which as we observed before, is a lower bound. Since the average load 
is also a lower bound for the preemptive case, the claimed lower bound applies 
even if preemptions are allowed. 



Horvath, Lam, and Sethi [6] proved that the above lower bound gives the 
optimal schedule length for preemptive scheduling of chains on uniformly related 
machines. The idea of extending their lower bound to general precedences using 
maximal chain decompositions is natural but does not appear to have been 
effectively used before. 
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Theorem 3. A maximal chain decomposition can he computed in 0{n^) time. 
If all pj are the same, the running time can he improved to 0{n^\/n). 

Proof. It is necessary to find the transitive closure of the given graph of prece- 
dence constraints. This can be done in O(n^) time using a BFS from each vertex. 
From a theoretical point of view this can be improved to OfnP) where oj < 2.376 
using fast matrix multiplication [2] . A longest chain in a weighted DAG can be 
found in O(n^) time using standard algorithms. Using this at most n times, a 
maximal chain decomposition can be obtained in O(n^) time. If all pj are the 
same (without loss of generality we can assume they are all 1), the length of a 
chain is the same as the number of vertices in the chain. It is possible to use 
this additional structure to obtain a maximal chain decomposition in 0(n^^/n) 
time. We omit the details. 

4 The Approximation Algorithm 

The approximation algorithm we develop in this section will be based on the 
maximal chain decompositions defined in the previous section. As mentioned 
in Section 2, we will describe an algorithm to produces an assignment of jobs 
to speeds at which they will be processed. Then we use the speed based list 
scheduling of [1] with the job assignment produced by our algorithm. 



1. compute a maximal chain decomposition of the jobs V = {Pi, ..., Pr}. 

2. set £ = 1. set B = max{AL, L-pj. 

3. for each speed 1 < z < 77 do 

(a) let 7 < t < r be maximum index such that l-Pj l/(™i'5i) < 473. 

(b) assign jobs in chains Pt,. . . ,Pt to speed i. 

(c) set £=t+l.li£>r return. 

4. return. 



Fig. 1. Algorithm Chain- Alloc 



The algorithm in Figure 1 computes a lower bound B on the optimal using 
Theorem 2. It then orders the chains in non-increasing lengths and allocates the 
chains to speeds such that no speed is loaded by more than four times the lower 
bound. We now prove several properties of the above described allocation which 
leads to the approximation guarantee of the algorithm. 

Lemma 1. Let Pi(u), . . . ,Pr he the chains remaining when Chain- Alloc consid- 
ers speed u in step 3 of the algorithm. Then 

C |73f!(u) ^ ‘2B and 

2. Either Pi(u), . . . ,Pr are allocated to speed u or 77„ > 273. 
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Proof. We prove the above assertions by induction on u. Consider the base case 
when u = 1 and = 1. From the definition of L-p it follows that |Pi|/si < B. 
Since Pi is the longest chain, it follows that |Pj|/si < P for 1 < j < r. Let t 
be the last chain allocated to si. If t = r we are done. If t < r, it must be the 
case that adding P(t+i) increases the average load on si to more than 4P. Since 
P(t+i)/si < B, we conclude that Di = > 3P > 2B. 

Assume that the conditions of the lemma are satisfied for speeds si to s„_i 
and consider speed s„. We will assume that £{u) < r for otherwise there is 
nothing to prove. We observe that the second condition follows from the first 
using an argument similar to the one used above for the base case. Therefore 
it is sufficient to prove the first condition. Suppose \Pe{u)\/su > 2P. We will 
derive a contradiction later. Let j = £(u) and let v be the index such that 
< j < Mf (recall that Mf = Yl'k=i ^fc)- If j > no such index exists 
and we set v to K, the slowest speed. If j < m, for convenience of notation we 
assume that j = Mf simply by ignoring other machines of speed s„. It is easy 
to see that v > u and j > Mf~^. From the definition of Lp, AL, and B, we 
get the following. If j < to then Lp > (X)i=i ’’’^kSk)- If j > to then 

AL > (X)i=i \Pi\)/^^k=i ‘knkSk)- In either case we obtain the fact that 

'T-’ IP- 1 

- < max{Pp, AL} = B (I) 

Z^k—1 

Since |Pj|/s« > 2B, it must be the case that |Pi|/s„ > 2B for all <i < j- 

This implies that 

i 

^ |P,| >2P(j-M“-')s« 

V 

> 2B E rukSk 

k—u 

j V 

\Pj\ > 2P TOfcSfc (2) 

2=1 k—u 

The last inequality follows since we are summing up more terms on the left hand 
side. From the induction hypothesis it follows that speeds si to s„_i have an 
average load greater than 2B. From this we obtain 

j-l u-l 

^\Pi\ > 2B^mkSk (3) 

i=l k—1 

j u-l 

^ \Pj\ >2By^ rrikSk (4) 

2=1 k—1 
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Combining Equations 2 and 4 we obtain the following. 

j U—1 V 

2 ^ I > 2S ^ rukSk + 2B'^ rrikSk 

i—1 k—1 k—u 

V 



> 2B E ITlkSk 



j V 

^ E I ^ ^ E 

i—1 k—1 

Equation 5 contradicts Equation 1. 

\P\ 

Corollary 1. If chain Pj is assigned to speed i, then < 2B. 
Corollary 2. Algorithm Chain-Alloc alloeates all chains. 



( 5 ) 



Lemma 2. For I < k < K , Dk < 

Proof. Since B < and the algorithm never loads a speed by more than an 

average load of 4B, the bound follows. 



Lemma 3. For the job assignment produced by Chain-Alloc C < 2KC^^^. 

Proof. Let P be any chain. We will show that ^j^pPj/sk{j) < 2KC^g^^ where 
k{j) is the speed to which job j is assigned. Let Ai be the set of jobs in P which 
are assigned to speed i. Let P^ be the longest chain assigned to speed i by the 
algorithm. We claim that |F^| > Pi- This is because the jobs in Ai form a 

chain when we picked Pe to be the longest chain in the max chain decomposition. 
From Corollary 1 we know that \Pi\/si < 2B < 2C^^^. Therefore it follows that 



E 

jeP 



Pi 

Sk{j) 



t- 

i=l 



Si 



< 2KC1 



Theorem 4. Using speed based list scheduling on the job assignment produced 
by Algorithm Chain-Alloc gives a 6K approximation where K is the number of 
distinct speeds. Furthermore the algorithm runs in O(n^) time. The running time 
can be improved to Ofnfy/n) if all pj are the same. 

Proof. From Lemma 2 we have Dk < 4C^^^ for 1 < k < K and from Lemma 3 
we have C < 2KCf^^^^. Putting these two facts together, for the job assignment 
produced by the algorithm Chain-Alloc, speed based list scheduling gives the 
following upper bound by Theorem 1. 

K 

C„,ax <C + Y.^k< 2KC*^,^ + AKC*^,^ < 6KC*^,^. 
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It is easy to see that the speed based list scheduling can be implemented in 
0{n^) time. The running time is dominated by the time to do the maximum 
chain decomposition. Theorem 3 gives the desired bounds. 



Corollary 3. There is an algorithm which runs in O(n^) time and gives an 
O(logm) approximation ratio to the problem of scheduling precedence constrained 
jobs on uniformly related machines. 

We remark here that the leading constant in the LP based algorithm in [1] 
is better. We also observe that the above bound is based on our lower bound 
which is valid for preemptive schedules as well. Hence our approximation ratio 
is also valid for preemptive schedules. In [I] it is shown that the lower bound 
provided by the LP relaxation is a factor of f?(logm/loglogm) away from the 
optimal. Surprisingly it is easy to show using the same example as in [1] that 
our lower bound from Section 3 is also a factor of l7(logm/loglogm) away from 
the optimal. 

Theorem 5. There are instances where the lower bound given in Theorem 2 is 
a factor of f2 (log m/loglogm) away from the optimal. 

Proof. The proof of Theorem 3.3 in [1] provides the instance and it is easily 
verified that any maximum chain decomposition of that instance is a factor of 
l7(logTO/loglogm) away from the optimal. 



4.1 Release Dates 

Now consider the scenario where each job j has a release date rj before which 
it cannot be processed. By a general result of Shmoys, Wein, and Williamson 
[10] an approximation algorithm for the problem without release dates can be 
transformed to one with release dates losing only a factor of 2 in the process. 
Therefore we obtain the following. 

Theorem 6. There is an O(logm) approximation algorithm for the problem 
Q\prec,rj\Craax that runs in time 0(n^). 

4.2 Scheduling Chains 

In this subsection we show that Chain-Alloc followed by speed based list schedul- 
ing gives a constant factor approximation if the precedence constraints are in- 
duced by a collection of chains. We give an informal proof in this version of 
the paper. We first observe that any maximal chain decomposition of a collec- 
tion of chains is simply the collection itself. The crucial observation is that the 
algorithm Chain-Alloc allocates all jobs of any chain to the same speed class. 
The two observation together imply that there are no precedence relations be- 
tween jobs allocated to different speeds. This allows us to obtain a stronger ver- 
sion of Theorem 1 where we can upper bound the makespan obtained by speed 
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based list scheduling as C'm&x < C + T^naxi<k<K Dk- Further we can bound C as 
maxi<i<r \Pi\/sk(i) where chain Pi is allocated to speed k(i). From Corollary 1 
it follows that C < 2B. Lemma 2 implies that maxKk<K Dk < 4_B. Combining 
the above observations yields a 6 approximation. 

Theorem 7. There is a (S approximation for the problem Q\chains\Cniax and a 
12 approximation for the problem Q\chains,rj\Cmax- 

Computing the maximal chain decomposition of a collection of chains is triv- 
ial and the above algorithm can be implemented in 0(n log n) time. 

5 Conclusions 

The main contribution of this paper is a simple and efficient O(logm) approxi- 
mation to the scheduling problem Q\prec\Cmax- Chudak and Shmoys [1] provide 
similar approximations for the more general case when the objective function is 
the average weighted completion time (Q\prec\^WjCj) using linear program- 
ming relaxations. We believe that the techniques of this paper can be extended 
to obtain a simpler and combinatorial algorithm for that case as well. It is known 
that the problem of minimizing makespan is hard to approximate to within a 
factor of 4/3 even if all machines have the same speed [8]. However, for the single 
speed case a 2 approximation is known, while the current ratio for the multi- 
ple speed case is only O(logm). Obtaining a constant factor approximation, or 
improving the hardness are interesting open problems. 
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Abstract. Enumerative approaches, such as branch-and-bound, to solv- 
ing optimization problems require a subroutine that produces a lower 
bound on the value of the optimal solution. In the domain of scheduling 
problems the requisite lower bound has typically been derived from either 
the solution to a linear-programming relaxation of the problem or the 
solution of a combinatorial relaxation. In this paper we investigate, from 
both a theoretical and practical perspective, the relationship between 
several linear-programming based lower bounds and combinatorial lower 
bounds for two scheduling problems in which the goal is to minimize the 
average weighted completion time of the jobs scheduled. 

We establish a number of facts about the relationship between these dif- 
ferent sorts of lower bounds, including the equivalence of certain linear- 
programming-based lower bounds for both of these problems to combi- 
natorial lower bounds used in successful branch-and-bound algorithms. 

As a result we obtain the first worst-case analysis of the quality of the 
lower bound delivered by these combinatorial relaxations. 

We then give an empirical evaluation of the strength of the various lower 
bounds and heuristics. This extends and puts in a broader context a 
recent experimental evaluation by Savelsbergh and the authors of the 
empirical strength of both heuristics and lower bounds based on different 
LP-relaxations of a single- machine scheduling problem. We observe that 
on most kinds of synthetic data used in experimental studies a simple 
heuristic, used in successful combinatorial branch-and-bound algorithms 
for the problem, outperforms on average all of the LP-based heuristics. 
However, we identify other classes of problems on which the LP-based 
heuristics are superior, and report on experiments that give a qualitative 
sense of the range of dominance of each. Finally, we consider the impact 
of local improvement on the solutions. 
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1 Introduction 

A well-studied approach to the exact solution of NP-hard scheduling problems 
may be called enumerative methods, in which (implicitly) every possible solution 
to an instance is considered in an ordered fashion. An example of these methods 
is branch-and-bound, which uses upper and lower bounds on the value of the 
optimal solution to cut down the search space to a (potentially) computationally 
tractable size. Such methods are typically most effective when the subroutines 
used to calculate both the upper and lower bounds are fast and yield strong 
bounds, hence quickly eliminating much of the search space from consideration. 

Although there are a wealth of approaches to designing the lower-bounding 
subroutines, we can identify two that have been particularly prominent. The 
first relies on a linear-programming relaxation of the problem, which itself is 
often derived from an integer linear-programming formulation by relaxing the 
integrality constraints; Queyranne and Schulz give an extensive survey of this 
approach [15]. The second relies on what we will call a combinatorial relaxation of 
the problem and yields what we will call a combinatorial lower bound. By this we 
simply mean that the lower bound is produced by exploiting some understanding 
of the structure of the problem as opposed to by solving a mathematical program. 
For example, in this paper we focus on combinatorial lower bounds that are 
obtained by relaxing the constraint (in a nonpreemptive scheduling problem) 
that the entire job must be processed in an uninterrupted fashion. 

Another approach to an NP-hard scheduling problem is to develop an ap- 
proximation algorithm. Here the goal is to design an algorithm that runs in 
polynomial time and produces a near-optimal solution of some guaranteed qual- 
ity. Specifically, we define a p-approximation algorithm to be an algorithm that 
runs in polynomial time and delivers a solution of value at most p times optimal; 
see [9] for a survey. In contrast, an enumerative approach attempts to solve a 
(usually small) problem to optimality, with no guarantee that the solution will 
be obtained in time polynomial in the size of the input. 

Recently various researchers have been successful in creating new connec- 
tions between linear-programming relaxations used to give lower bounds for 
certain scheduling problems and the design of approximation algorithms. Specif- 
ically, they have used these relaxations to develop approximation algorithms 
with small-constant-factor worst-case performance guarantees; as a by-product 
one obtains worst-case bounds on the quality of the lower bound delivered by 
these relaxations [14,10,7,18,17]. We define a p-relaxation of a problem to be a 
relaxation that yields a lower bound that is always within a factor of p of the 
optimal solution. 

In this paper we establish additional connections between different approaches 
to these problems. We consider two A^P-hard scheduling problems in which the 
goal is to minimize the average weighted completion time of the jobs scheduled: 
l\rj \ ^WjCj, the problem of scheduling n jobs with release dates on a single 
machine, and P]] X) ^^e problem of scheduling n jobs on identical parallel 

processors. For each problem we show that a combinatorial lower bound that 
was used successfully in a branch-and-bound code for the problem is equivalent 
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to the solution of a linear-programming relaxation that had been used in the 
design of approximation algorithms. As a consequence we give the first worst- 
case analysis of these sorts of combinatorial lower bounds. We also consider 
several related lower bounds and establish a number of facts about their relative 
strength. Finally, for l|rj| ^ WjCj, we give an empirical evaluation of the rela- 
tive performance of these different lower bounds, and compare the performance 
of the approximation algorithms based on the LP-relaxations with the heuristics 
used in the successful branch-and-bound code of Belouadah, Posner and Potts 
[ 2 ]. 

Brief Discussion of Previous Related Work: We begin with l|rj| ^WjCj. 
Dyer and Wolsey considered several linear-programming relaxations of this prob- 
lem as a tool for producing strong lower bounds [5] . Among those considered were 
two time-indexed linear programming relaxations, in which the linear program 
contains a variable for every job at every point in time. In the first relaxation 
{0, l}-variables yjt determine whether job j is processed during time t, whereas 
in a second stronger relaxation {0, l}-variables Xjt determine whether job j com- 
pletes at time t. 

Although both linear programs are of exponential size. Dyer and Wolsey 
showed that the y^t-LP is a transportation problem with a very special structure 
and thus can be solved in 0(n log n) time. The Xjt-ljP, which has been observed 
empirically to give strong lower bounds [19,22], is very difficult to solve due 
to its size. Van den Akker, Hurkens and Savelsbergh [21] developed a column- 
generation approach to solving these linear programs that made feasible the 
solution of instances with up to 50 jobs with processing times in the range of 
[0..30]. 

Inspired by the empirical strength of this relaxation. Hall, Schulz, Shmoys 
and Wein [10] gave a 3-approximation algorithm for \\xj \ ^ WjCj based on time- 
indexed linear programs. Their approximation algorithm in fact relies only on 
the weaker yj^-relaxation and simultaneously proves that the y^t-LP (and hence 
the stronger Xjt-LP) are 3- relaxations of the problem. Subsequent papers gave 
improved techniques with better constant performance guarantees [7,8]. In a re- 
cent empirical study, Savelsbergh, Uma and Wein [16] demonstrated that the 
yjt bound on many instances comes within a few percent of the Xjt bound at 
a greatly reduced computational cost. They also showed that the ideas that led 
to improved approximation algorithms also yielded improved empirical perfor- 
mance; the best heuristics gave rather good results on most sorts of randomly 
generated synthetic data. 

In parallel with work on linear-programming lower bounds for l\rj \ ’^WjCj 
there has been significant work on branch-and-bound algorithms for V\Vj \ ^ WjCj 
based on combinatorial lower bounds [2,3,4,12]. The most successful of these is 
due to Belouadah, Posner and Potts [2] who made use of two combinatorial lower 
bounds based on job splitting, and an upper bound based on a simple greedy 
heuristic. 

Although it is difficult to compare the efficacy of the branch-and-bound code 
of Belouadah, Posner and Potts with the branch-and-cut code due to Van den 
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Akker et. al. based on Xj(-relaxations [20] (since they were developed several 
years apart in different programming languages on different architectures, etc.) 
the evidence seems to be that neither much dominates the other; however, that 
of Belouadah, Posner and Potts seems to have been somewhat stronger, as they 
were able to solve to optimality problems of size 50 whereas van den Akker et. 
al solved to optimality problems of size 30. The enhanced strength of the lower 
bounds due to the x^t-relaxations does not make up for the amount of time it 
takes to solve them. 

Discussion of Results: This paper was born out of an interest to make more 
precise, from both an analytical and empirical perspective, the comparison be- 
tween the LP-based techniques and the techniques associated with the best com- 
binatorial branch-and-bound algorithm. In this process several potentially sur- 
prising relationships between these two approaches arose. Specifically, we show 
that the solution delivered by the y^t-based relaxation for Ijr^ | ^ '^jCj is identi- 
cal to that used to deliver the weaker of the two lower bounds used by Belouadah, 
Posner and Potts. We also show that the stronger of their two lower bounds, while 
empirically usually weaker than the x^t-based relaxation, neither always domi- 
nates that lower bound nor is it dominated by it. A corollary of this observation 
is that the optimal preemptive schedule for an instance of Ijrjj '^WjCj neither 
always dominates nor is dominated by the solution to the x^t-relaxation. 

We then establish a similar relationship for a different problem. Webster 
[23] gave a series of lower bounds for P|| '^WjCj that are based on a similar 
notion to the job-splitting approach of Belouadah, Posner and Potts. We show 
that the weakest of his lower bounds (which in fact was originally proposed by 
Eastman, Even and Isaacs in 1964 [6] ) is equivalent to a generalization of the 
yjt relaxation to parallel machines that was also used to give approximation 
algorithms for P\rj \ by Schulz and Skutella [18,17]. 

We then give an empirical evaluation of the quality of the different lower 
bounds and associated heuristics for ^rj\^WjCj-, this extends and puts in a 
broader context a recent experimental study of Savelsbergh and the authors [16]. 
We demonstrate that the stronger lower bound considered by Belouadah, Posner 
and Potts on synthetic data sets (which can be computed in O(n^) time) on aver- 
age improves on the yjt lower bound by a few percent, and that heuristics based 
on this relaxation improve by a few percent as well. However, we demonstrate 
that on most of the synthetic data sets we consider, the simple greedy heuristic 
used by Belouadah, Posner and Potts is superior to all of the heuristics based 
on the approximation algorithms associated with the different LP relaxations. 
It is only on data sets that were specifically designed to be difficult that the 
LP-based heuristics outperform the simple greedy approach. 

Finally, we note that simple local-improvement techniques are often very suc- 
cessful in giving good solutions to scheduling problems [1]; we therefore consider 
the impact of some simple local improvement techniques when applied both 
“from scratch” and to the solutions yielded by the various heuristics that we 
consider. 
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2 Background 

In this section we briefly review the relevant lower bounds and algorithms. In 
both problems we consider we have n jobs j, j = 1, . . . ,n, each with positive 
processing time pj and nonnegative weight Wj. For l|rj| ^WjCj with each job 
is associated a release date Vj before which it is not available for processing. 
LP-Relaxations: We begin with the two relevant linear-programming relax- 
ations of l\rj \ ^WjCj. As mentioned earlier, Dyer and Wolsey introduced sev- 
eral integer linear programming formulations of the problem. We focus on two. 
In the first, with variables pjt and completion-time variables Cj, yjt = 1 if job j 
is being processed in the time period [t, t -I- 1] and yjt = 0 otherwise. 

minimize 
subject to X)"=i 

T,J=iyjt = Pj, J = 

C'i = f ^ Er=i(^ + j = 1, . . . , n; 

Vjt >0, j = t = rj,...,T. 

As noted earlier, this linear program is a valid relaxation of the optimal 
preemptive schedule as well [10], and can be solved in 0(n log n) time [5]. The 
structure of the solution is in fact quite simple: at any point in time, schedule the 
available unfinished job with maximum Wj jpj (this may involve preemption) . 

In the second linear-program, which is much harder to solve, the binary 
variable Xjt for each job j {j = 1, , n) and time period t (t = pj, . . . , T), where 
T is an upper bound on schedule makespan, indicates whether job j completes 
in period t {xjt = 1) or not {xjt = 0). This relaxation is stronger than the yjt 
relaxation; in particular it is not a valid relaxation of the optimal preemptive 
schedule, and its integer solutions yield only nonpreemptive schedules. 

E TI 

j=l T,t=p^ CjtXjt 

subject to ^t=pj = 1, j = 1, . . . , n; 

W” a; ■ <1 t—1 T- 

Xjt >0, j = l,...,n, t = pj,...,T 

The assignment constraints (1) state that each job has to be completed ex- 
actly once, and the capacity constraints (2) state that the machine can process 
at most one job during any time period. 

Job Splitting: The lower bounds of Belouadah, Posner and Potts are based on 
job splitting. This technique is based on the idea that a relaxation of a nonpre- 
emptive scheduling problem may be obtained by splitting each job into smaller 
pieces that can be scheduled individually. If the objective function is ^WjCj, 
when we split job j into pieces we also must split its weight Wj among the pieces 
as well. In essence, we create a number of smaller jobs. If we split the jobs in 
such a way that we can solve the resulting relaxed problem in polynomial time, 



( 1 ) 

(2) 
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we obtain a polynomial-time computable lower bound on the optimal solution 
to the original problem. 

Note that an inherent upper bound on the quality of any lower bound 
achieved by this process is the value of the optimal preemptive schedule, as 
preemptive schedules correspond to splits in which all of the weight of the job j 
is assigned to the last piece to be scheduled, and such a weight assignment gives 
the strongest possible lower bound for a specified split of the job into pieces. 
Note however, that the preemptive version of l\rj\ ^ wjCj is also A^P-hard and 
we must settle for solving something weaker that can be computed in polynomial 
time. 

Belouadah, Posner and Potts give two lower bounds (BPPl and BPP2) based 
on job splitting. In BPPl, the pieces of job j are exactly those that arise in the 
optimal preemptive solution to the yjt relaxation, and the optimal solution to 
the resulting split problem has the same structure as that of optimal preemptive 
solution to the j/^t-relaxation. In this lower bound each piece of job j receives a 
fraction of weight Wj in exact proportion to the fraction of the size of pj that its 
size is. In the BPP2 lower bound as much weight as possible is shifted to later 
scheduled pieces of the job while maintaining the property that the structure of 
the optimal solution can be computed in polynomial time. 

Approximation Algorithms: Recent progress on approximation algorithms 
for Ijr^l '^WjCj is based on solving the linear relaxation of either of the afore- 
mentioned LPs and then inferring an ordering from the solution [14,10,7,8]. In 
this abstract we focus only on a few of these heuristics that were demonstrated 
in [16] to be the most useful computationally. 

In constructing an ordering we make use of the notion of an a-point [14,11]. 
We define the a-point of job j, 0 < a < 1, to be the first point in time, in the 
solution to a time-indexed relaxation, at which an a fraction of job j has been 
completed. We define the algorithm Schedule-by-Fixed-a, that can be applied to 
the solution of either LP relaxation, as ordering the jobs by their a-points and 
scheduling in that order. Goemans [7] has shown that for appropriate choice of 
a this is a (-\/2 -|- l)-approximation algorithm. Goemans also showed that by 
choosing a randomly according to a uniform distribution and then scheduling 
in this order, one obtains a randomized 2-approximation algorithm [7] and if 
one chooses using a different distribution, a 1.7451-approximation algorithm [8]. 
Either randomized algorithm can be derandomized by considering n different 
values of a, scheduling according to each of them, and then choosing the best. 
We call this algorithm Best-a, or, if only consider k equally-spaced values of a, we 
call the algorithm fc-Best-a. We note that it is simple to adapt these algorithms 
to be applied to combinatorial relaxations of l|rj| '^WjCj as well. 

3 Analytical Evaluation of Strength of Different Bounds 

One Machine: We first introduce some notation to describe the two BPP lower 
bounds. We say job I is “better’’ than job j if ^ < ^,or, equivalently, if ^ 

For both bounds jobs are split as follows. When a better job arrives, we split the 
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currently executing job into two pieces such that one piece completes at the 
arrival time of the new job and the the second piece is considered for scheduling 
later. When a job is split into pieces, its weight is also split. So if job j is split 
into k pieces, then each piece i has a processing time p* , a weight Wj and release 

date Tj, such that P} = Pj 

For the BPPl bound, the weights are assigned to the pieces of a job such 
that ^ ^ for all z = 1, . . . fc, whereas for BPP2 the weights are assigned to 

the pieces in a greedy fashion with the aim of maximizing the lower bound; in 
[2] they have shown that BPP2 is always greater than or equal to BPPl. 

Let us denote the lower bound given by BPPl as and the lower 

bound given by the yjt LP relaxation as 

Theorem 1. LBy^^ = LB^^^K 

Proof. As a preliminary we establish a bit more understanding of BPPl. Let 
P be the original instance with no split jobs. Let Pi denote the corresponding 
problem where one job, say j (with processing time pj, weight Wj and release 
date Tj), is split into k pieces. Say each piece i is of length p* and is assigned 

a weight Wj for z = 1, . . . ,k such that Yli=iPj ~ Pj These 

k pieces are constrained to be scheduled contiguously in Pi . The set of jobs in 
Pi are the k pieces of this split job j plus the remaining jobs in P. Obviously 
there is a one-to-one correspondence between the feasible schedules for P and 
Pi- Note that in Pi, the pieces of job j will be scheduled exactly during the 
interval P schedules job j. Therefore, it is sufficient to consider the contribution 
of this one job to the weighted completion time in both the schedules. Let the 
pieces of job j start at times tj, ... ,tj respectively. So in P, this job is scheduled 
during + Pj] and it contributes Wj * (tj +Pj)- 

The following equality was shown by Belouadah, Posner and Potts in [2]. 

k 

yjj * it] + p. ) = E (3) 

where Cj denotes the completion time of piece i of job j and 

CBRK, = E^j E Pj 

i—1 

can be thought of as the cost of breaking job j into k pieces. 

If we remove the constraint from Pi that the pieces of the job have to be 
scheduled contiguously, then the resulting problem P 2 gives a lower bound on 
the cost of Pi. This is true even if more than one job is split. Therefore, the idea 
is to split the jobs so that the optimal schedule for the resulting problem P 2 can 
be computed easily. So BPPl (and BPP2) split the jobs accordingly. 

We now turn to proving our theorem. Consider the (weighted) contribution 
of just one job, say j, to the respective lower bounds (denoted and 
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) . Let job j be released at rj with a processing time requirement of pj and 
weight Wj. Let this job be split into k pieces of lengths . . ,p’- starting at 
times tj,. . . respectively. So we have p] + p"j + ■ ■ • + Pj = Pj- BPPl would 

assign weights Wj = ^Wj for i = 1, . . . ,k. The cost of breaking job j is given by, 



fc-i 



CBRK, = E 






i—1 

k—1 k 

= E^-^5 E^^ 

i—1 ^ 

1^1 1^1 

1 1 . ^ ^ 

= 

Now the contribution of job j to the BPPl lower bound is, 

k 

PPl _ " 

^3 



Lb^ppi ^ J2wj{ti+p)) + CBRKj 



k 



= E + pj^ + l^iPi - ^ E 

2=1 



and the contribution of job j to the yjt lower bound is, 

Vjt{t + 2 ) 



LBf* = 



2 


+ 


Pj 


WjPj 




Wj 


2 




Pj 


WjPj 


+ 








' 4 ' 

Ml! 

2 ' 



“ +Pj) + 



M , WjPj Wj 



i=l 

= LB 



Pj 

BPPl 



‘^Pj 

2=1 






Summing over all jobs j we have the required result. 



□ 



As an immediate corollary we obtain an upper bound on the quality of lower 
bound provided by both BPPl and BPP2. Goemans et al. [8] proved that the 
?/jt-relaxation is a 1.685-relaxation of l|rj| P.WjCj] thus we see that BPPl and 
BPP2 are as well. We now turn to the relationship with the Xjt relaxation; it is 
known that this is stronger than the yjt [5]. 
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Theorem 2. The lower bound given by BPP2 neither always dominates nor is 
dominated by the Xjt -lower bound. 

The proof is by exhibiting two instances; one on which BPP2 is better and 
one on which Xjt is better. Due to space constraints the instances are omitted 
in this extended abstract. 

Parallel Identical Machines: In [23], Webster considers the problem 
P| I ^ WjCj He gives a series of progressively stronger lower bounds for this prob- 
lem all of which are based on ideas similar to job-splitting; these lower bounds 
lead to a successful branch-and-bound algorithm. The weakest of these bounds is 
actually due to a 1964 paper by Eastman, Even and Isaacs [6]. By an argument 
of a similar flavor to that of the one machine problem we can show that in fact 
the weakest bound is equivalent to an extension of the ?/jt-relaxation to parallel 
machines. In this extension the m machines are conceptualized as one machine 
of speed m. Schulz and Skutella show that this formulation is a 2-relaxation 
of PVj \ and a 3/2-relaxation of Pjj ^wjCj. Williamson [24] had also, 

independently, observed this equivalence between the lower bounds due to East- 
man, Even and Isaacs and the ?/jt-relaxation of P|| ^ WjCj. By establishing this 
equivalence we obtain a worst-case upper bound on the performance of all of 
Webster’s lower bounds. In fact, he considers a more general problem in which 
each processor can have non-trivial ready times - our results extend to this case 
as well. The details are omitted due to the space constraints of the extended 
abstract. 



4 Empirical Evaluation 

4.1 Experimental Design 

In [16] a comprehensive experimental evaluation is given of the relative strength 
of the yjt (BPPl) and Xjt LP relaxations and approximation algorithms based on 
them. Our goals in this section are threefold. (1) To quantify experimentally the 
relative strength of the BPP2 lower bound, which is no weaker than the BPPl 
bound but, in contrast to the Xjt-lower bound, can be computed efficiently, 
and of the approximation algorithms based on this stronger lower bound. (2) 
To compare the performance of the simple WSPT heuristic used by Belouadah 
et al. in their branch-and-bound algorithm with the LP-based approximation 
algorithms. (3) To understand the impact of local improvement on solutions to 
the problem. 

We sought to generate a rich enough set of instances to observe the fullest 
possible range of algorithm and lower bound performance. Our experimental 
design follows that both of Hariri and Potts [12] and of [16]. We worked with 
three sets of data. The first, which we call OPT, was a set of 60 instances with 
n = 30 jobs, with wj e [1..10] and pj G [1..5] in 20 instances and pj G [1..10] 
in 40 instances. For these instances we knew the optimal solutions from the 
branch-and-cut code of [20]. 
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We then generated a large Synthetic data set according to a number of 
parameters: problem size (n), distribution for random generation of the weights 
Wj, distribution for the random generation of the pj, and arrival process. The 
problem size n was chosen from {50, 100, 200, 500}. The release dates were gen- 
erated either by a Poisson process to model that, on average 2, 5, 10 or 20 jobs 
arrived every 10 units of time, (where 10 is the maximum job size) or uniformly 
in [0,^Pj]. Three distributions were used for the generation of each of the Wj 
and pj\ (i) uniform in [1, 10] (ii) normal with a mean of 5 and a standard devia- 
tion of 2.5, (iii) a two hump distribution, where, with probability 0.5 the number 
is chosen from a normal distribution with mean 2.5 and standard deviation 0.5, 
and with probability 0.5 the number is chosen from a normal distribution with 
mean 7.5 and standard deviation 0.5. Ten instances were generated randomly 
for each combination of parameters, for a total of 1800 instances. For all of these 
we computed the BPPl and BPP2 relaxations, and ran on all of these relax- 
ations the heuristics discussed in Section 2. Solutions to the Xjt relaxation were 
available for some of the instances from our work in [16]. 

Finally, as in [16] we considered the Hard data sets that were designed to 
provoke poor performance from the LP-based heuristics. All of these instances 
attempt to exploit the fact that the yjt relaxation is also a valid relaxation of 
the optimal preemptive schedule [10], and that therefore on instances for which 
the optimal preemptive schedule is much better than the optimal nonpreemp- 
tive schedule, the j/^t-based relaxation should perform poorly. Therefore these 
instances have one or several very large jobs, and a large number of tiny jobs 
that are released regularly at small intervals. In the Hardl data set we created 
instances with processing times 1 and 20 or 30. The size 1 jobs were generated on 
average 9 times more frequently than the size 20 or size 30 jobs. In the Hard2 
data set we generated job sizes in a two-hump distribution, with the humps at 
5 and 85; the size 5 jobs were generated on average 9 times more frequently. 

We note that all problem instances are available at http://ebbets.poly. 
edu/SCHED/onerj .html. 

Due to space considerations in this extended abstract we focus on a few data 
sets which turn out to be among the most difficult in their categories for the 
heuristics and that are at the same time representative of the overall behavior 
of that set. Specifically, we focus on the OPT, Hardl and Hard2, Syntheticl 
and Synthetic2. The Syntheticl set corresponds to the Synthetic set where 
release dates are generated by a Poisson process with arrival rates 2 and 5 
(n = 50, 100) and the Synthetic2 set corresponds to the release dates being 
generated uniformly (n = 50, 100,200,500). We also report results with respect 
to the average weighted flow time = ^ Wj{Cj —rj); while equivalent to average 
weighted completion time at optimality this criterion is provably much harder 
to approximate [13] and thus more interesting. 

4.2 Lower Bounds 

Table 1 reports on the relative performance of the different lower bounds. We 
note that the BPP2 lower bound does provide some improvement over the BPPl 
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bound at modest additional computational cost (O(nlogn) to 0(n^)), but since 
both are relaxations of the optimal preemptive schedule, on the Hardl data 
set the BPP2 bound is still far from the Xjt-lower bound. (Do not be misled by 
the small numbers recorded for Hard2. They are with respect to the best lower 
bound we know for that instance, which is BPP2.) Furthermore, note that on 
the Syntheticl data set sometimes BPP2 is better than the Xjt-relaxation, and 
this explains why none of the numbers in that column is 0 - the comparison 
is always with respect to the best lower bound for that instance. We note that 
the maximum improvement observed by BPP2 over BPPl on any instance was 
9.492%, on an instance in the Hardl data set. 



Table 1. Quality of the lower bounds with respect to the weighted flow time 
given by yjt, BPP2 and Xjt relaxations. We report on x 100, where 

BEST is the best available lower bound and LB is the corresponding lower 
bound. The values reported are averaged over all the instances in each case. 
Results for the yjt and Xjt relaxations are from the experimental work of Savels- 
bergh et al. 





OPT 


Hardl 


Hard2 


Syntheticl 


Synthetic2 


y.t(BPPi) 


4.506 


19.169 


0.848 


1.729 


1.848 


BPP2 


2.969 


15.539 


0.000 


0.941 


0.000 


Xjt 


1.322 


0.000 


N/A 


0.013 


N/A 



4.3 Upper Bounds 

In [16] we demonstrated that the various heuristics based on the LP-relaxations 
in general yield improved performance with improved quality of relaxation, al- 
though there are many instances for which this is not the case. Therefore it is of 
interest to study the impact of the BPP2 lower bounds on heuristic performance. 

Of greater interest is to compare the performance of all this LP “stuff” to 
the simple and naive heuristic used by Belouadah et al. in their combinatorial 
branch and bound code. This heuristic, WSPT, simply, when idle, selects for pro- 
cessing the unprocessed available job with the largest Wj/pj ratio and schedules 
it nonpreemptively. It is trivial to see that the worst-case performance of this 
heuristic is unbounded. Table 2 provides a summary of the performance of the 
various heuristics. 

The Table demonstrates that the heuristics based on the BPP2 bound do 
in fact do better by up to a few percent on the hard data sets. Most striking, 
however, is that except on the Hardl data set the WSPT heuristic is superior to 
all the LP relaxation-based approaches. Across the entire Synthetic data sets 
WSPT yields a better solution than the LP-approaches 70 — 80% of the time; 
on specific instances of the Hard data set it is up to 27% better than the best 
performance of the other heuristics. 
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Table 2. Performance of the algorithms based on the three {ujt, BPP2, Xjt) 
formulations. We report the mean, standard deviation and maximum of the ratio 
of the performance of the algorithm to the best lower bound. The first three rows 
of the table report results from the work of Savelsbergh et al. 





Hardl 


Syntheticl 


Synthetic2 




Mean 


SDev 


Max 


Mean 


SDev 


Max 


Mean 


SDev 


Max 


Schedule-by-fixed-a {yjt) 


2.207 


0.623 


3.906 


1.126 


0.093 


1.686 


1.334 


0.126 


1.892 


Best-a (yjt) 


1.683 


0.263 


2.164 


1.042 


0.042 


1.282 


1.155 


0.067 


1.371 


5-Best-a (yjt) 


1.775 


0.301 


2.244 


1.048 


0.046 


1.358 


1.171 


0.079 


1.419 


Schedule-by-fixed-a (BPP2) 


1.985 


0.507 


3.902 


1.116 


0.084 


1.643 


1.292 


0.109 


1.615 


Best-a (BPP2) 


1.628 


0.245 


2.122 


1.040 


0.039 


1.233 


1.144 


0.059 


1.311 


5-Best-a (BPP2) 


1.710 


0.274 


2.296 


1.046 


0.043 


1.233 


1.157 


0.067 


1.356 


Best-cn {xjt) 


1.854 


0.320 


2.428 


1.048 


0.045 


1.303 


- 


- 


- 


WSPT 


1.874 


0.372 


3.015 


1.033 


0.032 


1.148 


1.121 


0.048 


1.307 



In some sense this performance is quite surprising, but in another it is not very 
surprising at all. WSPT, while lacking any worst-case performance guarantee, is 
a natural choice that should work well “most of the time” . It is likely that when 
one generates synthetic data one is generating instances that one would think 
of “much of the time” . It is on the harder sorts of instances that the potential 
problems with such a heuristic arise. 

Although it would not be difficult to generate instances that would yield 
horrendous performance for WSPT the purpose of an experimental study is to 
yield insight into algorithm performance on instances that may be somewhat 
natural. Towards this end we generated a spectrum of problems in the framework 
of “a few jobs of size P and lots of jobs of size 1”, with P ranging from 1 to 
1000. The results of the experiment, plotted in Figure 1, gives a qualitative 
sense of the range in which WSPT is competitive with algorithms with worst- 
case performance guarantee, which is essentially up to P = 100 or so. 

Finally, we experimented with the impact of local improvement, which can 
be a very powerful algorithmic technique for certain scheduling problems. Specif- 
ically, given a solution we considered all pairs and all triples of jobs, switching 
any that led to an improved solution and iterating until no improving switch 
existed. We applied these to the solutions of all the heuristics. For the heuristics 
that considered several possible schedules we applied local improvement to all 
of them and took the best resulting schedule. The results, reported in Table 3, 
indicate that the combination of local improvement applied to a collection of 
high-quality schedules yield the best results. We note further that applying this 
local improvement from scratch (random orderings) was not competitive with 
its application to a good initial schedule; the latter approach on average yielded 
solutions of quality 40-50% better. The importance of the quality of the initial 
schedule for local improvement was also observed by Savelsbergh et al [16] in 
the case of a real resource constrained scheduling problem. 
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Fig. 1. Plot of ratios for for Best-a, 5-Best-a and WSPT. Each 

data point is an average of 10 instances. On the x-axis we plot the maximum 
job size Pmax- Pmax = k means that the processing time was generated so that 
p = 1 with a probability of 0.9 and p = k with a probability of 0.1. The arrival 
times were generated uniformly in the range [0..500] 



5 Conclusions 

Enthusiasts of worst-case analysis may be disappointed by the relatively poor 
showing of the approximation algorithms with respect to the simple WSPT 
heuristic. They may be consoled by their advantages on certain classes of diffi- 
cult problems and their supremacy when coupled with local improvement. We 
suggest, however, that although it is interesting to compare these approaches on 
simple models, perhaps the most important role for worst-case analysis is as a 
tool that forces the algorithm designer to have new ideas that may also be useful 
in other more complex problems^. Savelsbergh et al. showed that the idea of us- 
ing many a values led to improved solutions for actual complex manufacturing 
scheduling problems; for these problems it is unlikely that there exist simple 
heuristics that will perform as well in such settings. 

Finally, in this study we have touched on several approaches to scheduling 
problems: linear programming, approximation, combinatorial relaxations and 
branch-and-bound, and local improvement, and made a modest contribution to 
our further understanding of their relationship. We feel that it is an important 
direction to continue to try to understand in a unified fashion the different roles 
that these elements can play in the theory and practice of scheduling. 
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Table 3. Performance of the algorithms based on the three {ujt, BPP2, Xjt) 
formulations after some local improvement. We report the mean, standard devi- 
ation and maximum of the ratio of the performance of the algorithm to the best 
lower bound. 





Hardl 


Syntheticl 


Synthetic2 




Mean 


SDev 


Max 


Mean 


SDev 


Max 


Mean 


SDev 


Max 


Schedule-by-fixed-a {yjt) 


1.594 


0.264 


2.137 


1.088 


0.061 


1.330 


1.200 


0.073 


1.438 


20-Best-a {vjt) 


1.402 


0.148 


1.679 


1.022 


0.022 


1.138 


1.096 


0.035 


1.196 


5-Best-a (yjt) 


1.420 


0.164 


1.770 


1.024 


0.023 


1.140 


1.100 


0.037 


1.213 


Schedule-by-fixed-a (BPP2) 


1.591 


0.270 


2.141 


1.081 


0.058 


1.339 


1.175 


0.066 


1.371 


20-Best-a (BPP2) 


1.398 


0.133 


1.624 


1.022 


0.022 


1.143 


1.096 


0.035 


1.191 


5-Best-a (BPP2) 


1.413 


0.142 


1.652 


1.024 


0.023 


1.143 


1.100 


0.036 


1.213 


20-Best-a {xjt) 


1.429 


0.197 


1.871 


1.021 


0.022 


1.125 


- 


- 


- 


WSPT 


1.746 


0.284 


2.321 


1.031 


0.030 


1.147 


1.110 


0.042 


1.241 
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Abstract. For the classical quadratic assignment problem (QAP) that 
requires n objects to be assigned to n locations (the n x n-case), polyhe- 
dral studies have been started in the very recent years by several authors. 
In this paper, we investigate the variant of the QAP, where the number 
of locations may exceed the number of objects (the m x n-case). It turns 
out that one can obtain structural results on the m x n-polytopes by ex- 
ploiting knowledge on the n x n-case, since the first ones are certain pro- 
jections of the latter ones. Besides answering the basic questions for the 
affine hulls, the dimensions, and the trivial facets of the m x n-polytopes, 
we present a large class of facet defining inequalities. Employed into a 
cutting plane procedure, these polyhedral results enable us to compute 
optimal solutions for some hard instances from the QAPLIB for the first 
time without using branch-and-bound. Moreover, we can calculate for 
several yet unsolved instances significantly improved lower bounds. 



1 Introduction 

Let a set of m objects and a set of n locations be given, where m < n. We will 
be concerned with the following problem. Given linear costs for assigning 
object i to location j and quadratic costs q{(ij),{k,i)} for assigning object i to 
location j and object k to location I, the task is (in the non- symmetric case) 
to find an assignment, i.e., an injective map ip : {!,..., m} — > {!,..., n}, that 
minimizes 

m 

E 

2=1 2=1 






I H-i-Mi)) 
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In the symmetric case we have quadratic costs for assigning the two 

objects i and k anyhow to the two locations j and I, and we have to find an 
assignment Lp : {1, . . . , m} — > {1, . . . , n} that minimizes 

mm m 

9({i,fe}.{<p(i).v(fe)}) fo ■ 

i—1 k=i-\-l 2=1 




The classical quadratic assignment problem (introduced by Koopmans and 
Beckmann [10]) is a special case of the non-symmetric formulation, where m = n 
and d{(ij),(fe,i)} = fikdji + fkidij holds for some flow-matrix (fik) and some 
distance-matrix (dji). If one of the flow- or distance-matrix is symmetric then 
the problem is also a special case of the symmetric formulation given above. 

The QAP is not only from the theoretical point of view a hard one among the 
classical combinatorial optimization problems (Sahni and Gonzales [14] showed 
that even e-approximation is AfT^-hard), but it has also resisted quite well most 
practical attacks to solve it for larger instances. 

Polyhedral approaches to the classical case with m = n (the n x n-case) have 
been started during the recent years by Padberg and Rijal ([13,11]) as well as 
Jiinger and Kaibel ([4, 6, 5, 8]). In [5] the first large class of facet defining inequal- 
ities for the associated polytopes is presented. These inequalities turned out to 
yield very effective cutting planes that allowed to solve for the first time several 
instances from the QAPLIB (the commonly used set of test instances compiled 
by Burkarhd, Karisch, and Rendl [2]) to optimality without using branch-and- 
bound. 

In this paper, we describe a polyhedral approach to the case where the num- 
ber of objects m might be less than the number of locations n (the m x n-case). 
We restrict our presentation to the symmetric version. In Sect. 2 a problem 
formulation in terms of certain hypergraphs is introduced and the associated 
polytopes are defined. The trivial facets as well as the affine hulls of these poly- 
topes are considered in Sect. 3. In Sect. 4 rather tight relaxation polytopes are 
presented that are projections of certain relaxation polytopes for the n x n-case. 
In Sect. 5 we describe a large class of facet defining inequalities for the polytopes 
that are associated with the (symmetric) m x n-case. Strengthening the relax- 
ations of Sect. 4 by some of these inequalities in a cutting plane procedure, we 
can improve for some m x n-instances in the QAPLIB the lower bounds signifi- 
cantly. Moreover, we can solve several m x n-instances by a pure cutting plane 
procedure. Results of these experiments are given in Sect. 6. We conclude with 
some remarks on promising further directions of polyhedral investigations for 
the m X n-QAP in Sect. 7. 



2 QAP-Polytopes 

As indicated in the introduction, we restrict to the symmetric QAP in this paper. 
Since it provides convenient ways to talk about the problem, we first formulate 
the symmetric QAP as a problem defined on a certain hypergraph. 
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Throughout this paper, let m < n, A4 := {1, . . . , m}, and Af := {1, . . . , n}. 
We define a hypergraph Qm,n '■= (Vm,n, £m,n) on the nodes Vm,n '■= A4xAf with 
hyperedges 

^m,n ■■= G k,j ^ z| . 

A hyperedge {{i,j), {k, 1), {i, 1), (k,j)} is denoted by {i,j, k, 1). The sets row^ := 
{{i,j) G Vm,n ■ j G Af} and colj := {(z, j) G Vm,n ■ i G A4} are called the i-th 
row and the j-th column of Vm,n, respectively. 

We call a subset C C Vm,n of nodes a clique of the hypergraph Qm,n if it 
intersects neither any row nor any column more than once. The maximal cliques 
of Gm,n are the m-cliques. The set of hyperedges that is associated with an m- 
clique C C Vm,n of Gm,n consists of all hyperedges that share two nodes with C. 
This set is denoted by £m,n{C). Solving symmetric QAPs then is equivalent to 
finding minimally node- and hyperedge- weighted m-cliques in Qm,n- 

We denote by and the characteristic vectors 

of subsets of Vm,n and £m,n, respectively. Thus the following polytope encodes 
the structure of the symmetric QAP in an adequate fashion (where we simplify 

SQAVm,n '■= conv |(x^, z^) : C is an m-clique of Gm,n'^ 

The (mixed) integer linear programming formulation in Theorem 1 is quite 
basic for the polyhedral approach. Let be the set of all hyperedges that 

contain both nodes (i,j) and (k,l). As usual, for any vector u £ and any 
subset L' <£ L oi indices we denote u{L') := 

Theorem 1. Let 1 < m < n. A vector (x,z) £ x is a vertex of 

SQAVm,n, i-e., the characteristic vector of an m-clique of Qm,n, if and only if 



it satisfies the following conditions: 

a;(rowj) = 1 {i £ A4) (1) 

x(colj) <1 {j £ Af) (2) 

- X(ij) - X(fej) -b z =0 {i,k & M,i < k,j £ Af) (3) 

> 0 {h £ £rn,n) (4) 

Xy £ {0, 1} (zz £ 'Vrn,n) (b) 



Proof. The “only if part” is obvious. To prove the “if part” , let {x, z) G x 

satisfy (1), . . . ,(5). Since a; is a 0/1-vector that satisfies (1) and (2) it must 
be the characteristic vector of some m-clique C C Vm,n of Gm,n- Considering 
two appropriate equations from (3), one obtains (by the nonnegativity of z) that 
> 0 implies X(ij) = X(^k,i) = 1 or X(tg) = X(k,j) = 1- But then, in each 
of the equations (3) there is at most one hyperedge involved that corresponds 
to a non-zero component of z. This leads to the fact that > 0 implies 
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Z{ij,k,i) = 1) and that = a;(fc = 1 implies = 1. Hence, z must be 

the characteristic vector of Sm,n{C) □ 

How is the n x n-case related to the m x n-case? Obviously, SQAVm,n arises 
from5Q^7^„_„ by the canonical orthogonal projection xK'^">" — > 

^ Wrn,n ~ Vn,n \ Vm,n and Frn,n ~ { (b Jj ^5 0 ^ ^n,n ■ (b jj ^7 0 

Wm,n ^ 0} be the sets of nodes and hyperedges that are “projected out” this 
way. The following connection is very useful. 

Remark 1. If an inequality (a,b)’^ {x, z) < a defines a facet of SQAVn,n and 
a„ = 0 holds for all v e Wm,n as well as 6/j = 0 for all h € Tjn.n, then the 
“projected inequality” (a' ,b')'^ (x' , z') < a with {a',b') = a^™’’^^(a,b) defines a 
facet of SQAVm,n- 

The following result shows that in order to investigate the m x n-case with 
m < n it suffices to restrict even to rn < n — 2. In fact, it turns out later that 
the structures of the polytopes for m < n — 2 differ a lot from those for m = n 
or m = n — 1. 

Theorem 2. For n > 2 the canonical orthogonal projection x 

jjfn.n — > ^ jj£n-i,n induccs On isomorphism between the polytopes 

SQAVn,n and SQAVn-l,n- 

Proof. In the n x n-case, in addition to (I) and (3) the equations 

a;(colj) = I (j G Af) (6) 

- j) - -b z -Iq ) = 0 {ij,l eAf,j <l) (7) 

are valid for SQAVn.n- Obviously, the columns of these equations that corre- 
spond to nodes in or to hyperedges in Fn-i,n are linearly independent. 

This implies the theorem. □ 

3 The Basic Facial Structures of SQ,AfPm,n 

The questions for the affine hull, the dimension, and the trivial facets are an- 
swered by the following theorem. 

Theorem 3. Let 3 < m < n — 2. 

(i) The affine hull of SQAVm.n is precisely the solution space of (1) and (3). 

(a) SQAVm,n has dimension dim(K''^’”'" x — {m + mn{m— l)/2). 

(Hi) The nonnegativity constraints (x,z) > 0 define facets of SQAVm.n- 
(iv) The inequalities (x,z) < 1 are implied by the (1) and (3) together with 
the nonnegativity constraints (x,z) > 0. 
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Proof. Part (iv) is a straightforward calculation. While (iii) follows immediately 
from Remark 1 and the fact that the nonnegativity constraints define facets in 
the n X n-case (see [4]), part (i) (which implies (ii)) needs some more techniques, 
which are not introduced here due to the restricted space. They can be found in 
detail in [8]. The key step is to project the polytope isomorphically into a lower 
dimensional vector space, where the vertices have a more convenient coordinate 
structure. □ 

4 Projecting a Certain Relaxation Polytope 

In [4] it is proved that the affine hull of the polytope SQAVn,n (the n x n-case) 
is described by (1), (3), (6), and (7), i.e., a,S{SQAVn,n) is the solution space 
of the following system (where we use capital letters (AT, Z) in order to avoid 



confusion between the n x n- and the m x n-case): 

A"(rowj) = 1 (i e Af) (8) 

X(col,) = l (jGAf) (9) 

- + Z = 0 (z, j, k A <k) (10) 

- + Z = 0 (z, j, / G A7, j < 0 (11) 



The fact that (9) and (11) are needed additionally to describe the affine hull of 
the polytope in the n x n-case is the most important difference to the m x n-case 
with m < n — 2. 

It turned out ([12,8]) that minimizing over the intersection SSQVn.n of 
aA{SQAVn,n) and the nonnegative orthant empirically yields a very strong lower 
bound for the symmetric n x n-QAP. In contrast to that, minimizing over the in- 
tersection S£QVm,n of a,S{SQAVm,n) and the nonnegative orthant usually gives 
rather poor lower bounds (for m < n — 2). However, solving the corresponding 
linear programs is much faster in the m x n case (as long as m is much smaller 
than n). 

In order to obtain a good lower bound also in the mxrz-case, one could add n— 
m dummy objects to the instance and after that calculate the bound in the nxn- 
model. Clearly it would be desirable to be able to compute that bound without 
“blowing up” the model by adding dummies. The following result provides a 
possibility to do so, and hence, enables us to compute good lower bounds fast in 
case of considerably less objects than locations. One more notational convention 
is needed. For two disjoint columns colj and col; of Gm,n we denote by (colj : col;) 
the set of all hyperedges that share two nodes with col^ and two nodes with col/. 

Theorem 4. Let 3 < m < n — 2. A point (x,z) G x is contained 

in a'^"^’^\S£QPn,n) if and only if it satisfies the following linear system: 

x{rowi) = 1 (z G M) (12) 

a;(colj) <1 (j G Af) (13) 

- - X(fcj) -b z =0 (i,k&A4,i<k,j£Af) (14) 
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(15) 


x{co\j U col;) — z{{co\j : col;)) < 1 


(j,^ G AT,j < 1) 


(16) 


> 0 


{y G 


(17) 


Zh>Q 


{h G ^m,n) 


(18) 



Proof. It should be always clear from the context whether a symbol like row^ is 
meant to be the i-th row of Vn,n or of Vm,n- The rule is that in connection with 
variables denoted by lower-case letters always Vm,n is the reference set, while 
variables denoted by upper-case letters refer to 

We shall first consider the “only if claim” of the theorem. Let {X, Z) € 
SSQVn.n, and let {x,z) = Z) be the projections of {X, Z). Obviously, 

(12), (14), and the nonnegativity constraints hold for {x, z). The inequalities (13) 
and (15) follow from (9), (11) and the the nonnegativity of (X,Z). 

It remains to show that the inequalities (16) are satisfied by {x,z). The 
equations 



X(colj U col;) — Z{{colj : col;)) = 1 (j, I £ Af,j < 1) (19) 

hold, since (X,Z) G a,S{SQAVn,n) and they are easily seen to be valid for 
SQAVn.n- For j e Af we denote 

Qj := {(z, j) : 1 < z < m} and Qj := {{i, j) : m < i < n} . 

Adding some equations of (11) (the ones with z > m) to (19) yields equations 

X{Q,) + X{Qi) - Z{{Q, : Q;)) + Z{{Q, : Qi)) = 1 {j,l £ U,j < 1) . (20) 



Thus, by the nonnegativity of Z, the inequalities (16) hold for the projected 
vector (x,z). 

We come to the more interesting “if claim”. In order to show that the 
given system (12), ... ,(18) of linear constraints forces the point (x,z) to be 
contained in the projected polytope a^'^''^\SEQVn,n), we shall exhibit a map 
(j) : X > X that maps such a point (x,z) satisfying 

(12), . . . ,(18) to a point {X, Z) = <p{x, z) £ S£QVn,n which coincides with (x, z) 
on the components belonging to Qm,n (as a subgraph of Qn,n)- Hence, the first 
step is to define (X, Z) = 4>{x,z) as an extension of {x,z), and the second step 
is to prove that this {X, Z) indeed satisfies (8), . . . ,(11), as well as {X, Z) > 0. 
The following extension turns out to be a suitable choice (recall that m < n — 2): 









1 — a:(colj) 
n — m 

n — m 

2^1 — a;(colj U col;) -I- z ((col^ : col;)) ^ 
{n — m — l){n — m) 



(z > m) 

(z < TO, /c > m) 

(z, k > m) 



( 21 ) 

(22) 



(23) 
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Let (x, z) G X satisfy (12), . . . ,(18), and let {X, Z) = (j){x, z) be the 

extension defined by (21), . . . ,(23). Clearly, X is nonnegative (by (13)) and Z 
is nonnegative (by (15) for i < m, k > m and by (16) for i,k > m). 

The validity of (8), ... ,(11) for {X,Z) is shown by the following series of 
calculations. We use the notation 

:= {h e Sm,n ■ (i,j) G h} 

for the set of all hyperedges of Gm,n that contain the node (z,j). Note that by 
(14) we have 



^ (^(b j)) = {m- 2)x(ij) + x(colj) . 



Equations (8): For i <m this is clear, and for i > m we have 



X(rowi) ^ = ^ ^l^x^ 

71 — m 






^ (n- ^x(colj)) 
^ (n - ^ x(row,)) 



ieM 



(IJ) ^ 



(24) 



Equations (9): For j G Af we have 

( 21 ) 



X(colj) = x(colj) + (n — m) 

= 1 . 



1 — x(colj) 



n — m 



Equations (10): For i,k < m these equations clearly hold. In case of i < m and 
k > m we have 






V (bi)J 



(21M22) l-x(colj) ^ 

“ ^(hi) „ _ ^ ^ 2^ 






n — m 



ieX\j 



= -X(ij) + ^ ^ (a^(coli) - f + {n- 2)xpj) + x(rowi) - z{A{i,j 

( 12 ), ( 24 ) 1 , , 

= -X(i 4) H [n - m)xu j) 

n — m 



= 0 . 
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It remains to consider the case i,k > m. Here, we get (using a := 
in order to increase readability) 

(23) 

= - X(k,j) + 2a ^ (1 - a;(colj U colj) + z{{co\j : col;))) 

+2a(n - 1 - (n - 2)x(colj) - x{Vm,n) + ^ ^z{A{i,j))"j 

ieM 

(12), (24) / . 1 

= — A(jj) — +2a(n — m— 1 + (to — n + l)a:(colj) I 

2 

= ~ ^{k,j) + ^ _ ^(1 ~ 3;(colj)) 

(21) 

= ’ 0 . 

Equations (11): If i <m holds, then we have 



^(ij) 



~ X(ij) + z 




( 2 _ 2 ) _ 






E 






E 






kGA4\i 






( 4 il) 



E 

keAf\M 



kej\t\M 

^(hO ^ 



n — m 




= 0 . 



For i > TO we have 





X(i,i) + z 


{ 411 ) 


1 


= 




X{i,i) + 


E 








keM 


(22)p3) 




X{i,i) + 


E 








kGM 




+2 


E 


1-; 



k^{Af\M)\i 



keM\M\i 

a^(M)+a^(fco)-^(^(fc;S) 

n — m 

x{colj U col/) + z((colj : col 
(n — m — l)(n — to) 
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= ^ ^ (a:(col/ + x(colj) - 2z((col; : colj 

2 / 

H ( 1 — x(coL- U col;) + 2 ((coL- : col;' 

n-m\ 



' ' ' ^ n — m \ 

( 21 ) 

= ’ 0 . 



□ 

Finally, we investigate the system (12), . . . ,(18) with respect to the question 
of redundancies. From Theorem 3 we know already that the nonnegativity con- 
straints (17) and (18) define facets of SQAVm,n as well as that (12) and (14) 
are needed in the linear description of the affine hull of the polytope. Thus it 
remains to investigate (13), (15), and (16). And in fact, it turns out that one of 
these classes is redundant. 

Theorem 5. Let 4 < m < n — 2. 

(i) The inequalities (13) are redundant in (12), ... ,(18). 

(a) The inequalities (15) and (16) define facets of SQAVm,n- 

Proof. To prove part (i) observe that the equations 

(m - l)a; (col^) - 2 ((colj : Vm,n \ colj)) = 0 (j G Af) (25) 

and 

x{Vm,n)=m (26) 

hold for SQAVm.n- Thus, they are implied by the linear system (12), (14) due 
to Theorem 3. Adding up all inequalities (16) for a fixed j G Af and alH G AT \j 
and subtracting (25) (for that j) and (26) yields 

n — m — 1 > ^ ^a;(colj U col;) — z((colj : col;))^ 

-(to - l)a:(colj) -I- z{{colj : \ colj)) - x{Vm,n) 

= (n - 1) a; (colj) + ^ a; (col;) - 2 ((colj : \ colj)) 

-(to - l)a;(colj) -I- z((colj : V^.n \ colj)) - x{Vm,n) 

= {n — m— l)a;(colj) , 

what proves part (i) due ton — to— 1>1. 

The proof of (ii) needs the same techniques as mentioned in the proof of 
Theorem 3, and is omitted here as well. It can be found in [8]. □ 
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5 A Large Class of Facets 

In [5] a large class of facet defining inequalities for the n x n-case is investigated. 
Many of them satisfy the requirements stated in Remark 1. We briefly introduce 
these inequalities here, and demonstrate in Sect. 6 how valuable they are for 
computing good lower bounds or even optimal solutions for QAPs with less 
objects than locations. 

Let P\,P 2 C M and Qi,Q 2 C Af be two sets of row respectively column 
indices with n P 2 = 0 and Qi n Q 2 = 0- Define S := (Pi x Qi) U (P 2 x Q 2 ) 
and T := (Pi x Q 2 ) U (P 2 x Qi). In [5] it is shown that the following inequality 
is valid for SQAVn,n for every /? G Z (where z{S) and z{T) are the sums 
over all components of 2 ; that belong to hyperedges with all four endnodes in S 
respectively in T, and {S : T) is the set of all hyperedges with two endnodes in 

5 and the other two endnodes in T) : 

- Px{S) + {(3- l)x(T) - z(5) - z{T) + z{{S : T)) < (27) 

Clearly the validity carries over to the m x n-case. The inequality (27) is called 
the 4~box inequality determined by the triple (5, T, /3). 

In this paper, we concentrate on 4-box inequalities that are generated by a 
triple (0,T, /3) (i.e., we have Pi = Q 2 = 0 or P 2 = Qi = 0), which we call 1- 
box inequalities. Empirically, they have turned out to be the most valuable ones 
within cutting plane procedures among the whole set of 4-box inequalities. 

In [5], part (i) of the following theorem is proved, from which part (ii) follows 
immediately by Remark 1. 

Theorem 6. Let n> 7. 

(i) Let P,Q C J\f generate T = P x Q C Vn,n, and let (3 G Z be an integer 
number such that 

• P> 2, 

• \P\,\Q\>f3 + 2, 

• 1^*1) \Q\ — 3, and 

• \P\ + \Q\<n + f3-5 
hold. Then the 1-box inequality 

{(3-1)x{T)-z{T)<^^^^ (28) 

defined by the triple fb,T,l3) defines a facet of SQAVn.n- 

(ii) For m < n and P C Ai the 1-box inequality (28) defines a facet of 
SQAVm.n as well. 

6 Computational Results 

Using the ABACUS framework (Jiinger and Thienel [7]) we have implemented a 
simple cutting plane algorithm for (symmetric) m x n-instances (with m < n — 2) 
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that uses (12), (14), . . . ,(18) as the initial set of constraints. Thus, by Theorem 4 
(and Theorem 5 (i)), the first bound that is computed is the symmetric equa- 
tion bound (SEQB), which is obtained by optimizing over the intersection of 
a,S{SQAPn,n) with the nonnegative orthant. 

The separation algorithm that we use is a simple 2-opt based heuristic for 
finding violated 1-box inequalities with (3 = 2. We limited the experiments to 
this small subclass of box inequalities since on the one hand they emerged as 
the most valuable ones from initial tests, and on the other hand even our simple 
heuristic usually finds many violated inequalities among the 1-box inequalities 
with (3 = 2 (li it is called several times with different randomly chosen initial 
boxes T). 

The experiments were carried out on a Silicon Graphics Power Challenge 
computer. For solving the linear programs we used the barrier code of CPLEX 
4.0, which was run in its parallel version on four processors. 

We tested our code on the esc instances of the QAPLIB, which are the 
only ones in that problem library that have much less objects than locations. 
Note that all these instances have both a symmetric (integral) flow as well as 
a symmetric (integral) distance matrix, yielding that only even numbers occur 
as objective function values of feasible solutions. Thus, every lower bound can 
be rounded up to the next even integer number greater than or equal to it. 
Our tests are restricted to those ones among these instances that have 16 or 32 
locations. All the escl6 instances (with 16 locations) were solved for the first 
time to optimality by Clausen and Perreghard [3]. The esc32 (with 32 locations) 
instances are still unsolved, up to three easy ones among them. 

Table 1 shows the results for the escl6 instances. The instances escl6b, 
escl6c, and escl6h are omitted since they do not satisfy m < n — 2 (escl6f 
was removed from the QAPLIB since it has an all-zero flow matrix). The bounds 



Table 1. The column objects contains the number of objects in the respective 
instance, opt is the optimal solution value, SEQB is the symmetric equation 
bound (i.e., the bound after the first LP), box is the bound obtained after some 
cutting plane iterations, LPs shows the number of linear programs being solved, 
time is the CPU time in seconds, and speed up is the quotient of the running 
times for working in the n x n- and in the m x n-model. The last column gives 
the running times Clausen and Perregard needed to solve the instances on a 
parallel machine with 16 i860 processors. 



name 


objects 


opt 


SEQB 


box 


LPs 


time (s) 


speed up 


CP (s) 


escl6a 


10 


68 


48 


64 


3 


522 


4.87 


65 


escl6d 


14 


16 


4 


16 


2 


269 


2.74 


492 


escl6e 


9 


28 


14 


28 


4 


588 


3.37 


66 


escl6g 


8 


26 


14 


26 


3 


58 


14.62 


7 


esc 161 


9 


14 


0 


14 


4 


106 


28.18 


84 


escl6j 


7 


8 


2 


8 


2 


25 


32.96 


14 
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produced by our cutting plane code match the optimal solution values for all 
instances but esclSa (see also Fig. 1). Working in the m x n-model speeds up 
the cutting plane code quite much for some instances. The running times in the 
m X n-model are comparable with the ones of the branch-and-bound code of [3] . 




esc16a esc16d esc16e esc16g esc16i esc16j 



Fig. 1. The bars (gray for SEQB and black for the bound obtained by the box 
inequalities) show the ratios of the lower and upper bounds, where the upper 
bounds are always the optimal solution values here. 



For the esc32 instances, our cutting plane algorithm computes always the 
best known lower bounds (see Tab. 2 and Fig. 2). The three instances esc32e, 
esc32f, and esc32g were solved to optimality for the first time by Briingger, 
Marzetta, Clausen, and Perregard [1]. Our cutting plane code is able to solve 
these instances to optimality within a few hundred seconds of CPU time (on 



Table 2. The column labels have the same meanings as in Tab. 1. Additionally, 
upper gives the objective function value of the best known feasible solution and 
prev lb denotes the best previously known lower bound. (Running times with a 
* are only approximately measured due to problems with the queuing system of 
the machine). 



name 


objects 


upper 


prev lb 


SEQB 


box 


LPs 


time (s) 


esc32a 


25 


130 


36 


40 


88 


3 


62988 


esc32b 


24 


168 


96 


96 


100 


4 


★60000 


esc32c 


19 


642 


506 


382 


506 


8 


★140000 


esc32d 


18 


200 


132 


112 


152 


8 


★80000 


esc32e 


9 


2 


2 


0 


2 


2 


576 


esc32f 


9 


2 


2 


0 


2 


2 


554 


esc32g 


7 


6 


6 


0 


6 


2 


277 


esc32h 


19 


438 


315 


290 


352 


6 


119974 
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four processors). These are about the same running times as needed by [1] with 
a branch-and-bound code on a 32 processor NEC Cenju-3 machine. 

The formerly best known lower bounds for the other esc32 instances were 
calculated by the triangle decomposition bounding procedure of Karisch and 
Rendl [9]. The bounds obtained by the cutting plane code improve (or match, in 
case of esc32c) all these bounds. The most impressive gain is the improvement 
of the bound quality from 0.28 to 0.68 for esc32a. While for the escl6 instances 
switching from the n x n- to the m x n-model yields a significant speed up, in case 
of the esc32 instances to solve the linear programs even became only possible 
in the m x n-model. 

Nevertheless, for the hard ones among the esc32 instances the running times 
of the cutting plane code are rather large. Here, a more sophisticated cutting 
plane algorithm is required in order to succeed in solving these instances to 
optimality. This concerns the separation algorithms and strategies, the treatment 
of the linear programs, as well as the exploitation of sparsity of the objective 
functions, which will be briefly addressed in the following section. 




esc32a esc32b esc32c esc32d esc32e esc32f esc32g esc32h 



Fig. 2. The dark gray and the black bars have the same meaning as in Fig. 1. 
Additionally, the light gray bars show the qualities of the previously best known 
lower bounds. 



7 Conclusion 

The polyhedral studies reported in this paper have enabled us to build for the 
first time a cutting plane code for QAPs with less objects than locations that 
has a similar performance as current parallel branch-and-bound codes for smaller 
instances and gives new lower bounds for the larger ones. More elaborated sepa- 
ration procedures (including parallelization) and a more sophisticated handling 
of the linear programs will surely increase the performance of the cutting plane 
algorithm still further. 

At the moment, the limiting factor for the cutting plane approach is the size 
(and the hardness) of the linear programs. But if one considers the instances 
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in the QAPLIB more closely, it turns out that the flow matrices very often are 
extremely sparse. If one exploits this sparsity, one can “project out” even more 
variables than we did by passing from the n x n- to the m x n-model. In our 
opinion, investigations of the associated projected polytopes will eventually lead 
to cutting plane algorithms in much smaller models, which perhaps will push 
the limits for exact solutions of quadratic assignment problems far beyond the 
current ones. 
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Abstract. Semidefinite relaxations of quadratic 0-1 programming or 
graph partitioning problems are well known to be of high quality. How- 
ever, solving them by primal-dual interior point methods can take much 
time even for problems of moderate size. Recently we proposed a spec- 
tral bundle method that allows to compute, within reasonable time, ap- 
proximate solutions to structured, large equality constrained semidehnite 
programs if the trace of the primal matrix variable is fixed. The latter 
property holds for the aforementioned applications. We extend the spec- 
tral bundle method so that it can handle inequality constraints without 
seriously increasing computation time. This makes it possible to apply 
cutting plane algorithms to semidefinite relaxations of real world sized 
instances. We illustrate the efficacy of the approach by giving some pre- 
liminary computational results. 



1 Introduction 

Since the landmark papers [9,4,10,2] it is well known that semidefinite pro- 
gramming allows to design powerful relaxations for constrained quadratic 0-1 
programming and graph partitioning problems. The most commonly used algo- 
rithms for solving these relaxations, primal-dual interior point methods, offer 
little possibilities to exploit problem structure. Typically their runtime is gov- 
erned by the factorization of a dense symmetric positive definite matrix in the 
number of constraints and by the line search that ensures the positive definite- 
ness of the matrix variables. Computation times for problems with more than 
3000 constraints or matrix variables of order 500, say, are prohibitive. Very re- 
cently, a pure dual approach has been proposed in [1] that is able to exploit the 
sparsity of the cost matrix in the case of the max-cut relaxation with diagonal 

R. E. Bixby, E. A. Boyd, and R. Z. Rfos-Mercado (Eds.): IPCO VI 
LNCS 1412, pp. 423-435, 1998. © Springer-Verlag Berlin Heidelberg 1998 
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constraints. It is not yet clear whether these results extend to problems with a 
huge number of less structured constraints. 

The spectral bundle method [6] works on a reformulation of semidefinite re- 
laxations as eigenvalue optimization problems and was developed to provide ap- 
proximate solutions to structured problems fast. In contrast to standard bundle 
methods [7,11], a non-polyhedral semidefinite cutting plane model is constructed 
from the subgradients. Reinterpreted in terms of the original semidefinite pro- 
gram, the semidefinite model ensures positive semidefiniteness of the dual matrix 
variable on a subspace only. Subgradients correspond to eigenvectors to negative 
eigenvalues and are used to correct the subspace. By means of an aggregate sub- 
gradient the dimension of the subspace can be kept small, thus ensuring efficient 
solvability of the subproblems. Lanczos methods (see e.g. [3]) allow to compute a 
few extremal eigenvalues and their eigenvectors efficiently by a series of matrix- 
vector multiplications which do not require the matrix in explicit form. Thus 
structural properties of cost and coefficient matrices can be exploited. 

Like most first order methods, the spectral bundle method exhibits fast 
progress in the beginning, but shows a strong tailing off effect as the optimal 
solution is approached. Fortunately many semidefinite relaxations do not have 
to be solved exactly. Rather an approximate solution is used to improve, e.g., by 
cutting planes, the current relaxation, which is then resolved. 

In its original form the spectral bundle method is designed for equality con- 
straints only because sign constraints on the dual variables may increase com- 
putation times for the semidefinite subproblem significantly. In this paper we 
employ Lagrangian relaxation to approximate the solution of a sign constrained 
semidefinite subproblem. Surprisingly just one update of Lagrange multipliers 
per function evaluation suffices to ensure convergence. The semidefinite subprob- 
lem can be solved as efficiently as in the unconstrained case, thus rendering this 
method an attractive choice for large scale semidefinite cutting plane algorithms. 

Section 2 introduces some notation and explains the connection to semidefi- 
nite programming. This is followed by a very brief review of important properties 
of the maximal eigenvalue function. Section 4 explains the extension of the spec- 
tral bundle method to inequality constraints. In Section 5 we discuss efficiency 
aspects of the subproblem solution. Section 6 gives computational results. 

2 Semidefinite Programs 

Let Sn denote the space of symmetric matrices of order n. The inner product 
in this space is the usual matrix inner product, (A,B) = tr{B^A) for A,Bg 
IR™x". Let 5+ denote the set of symmetric positive semidefinite matrices. 5+ 
is a pointed closed convex cone. Except for its apex {0}, a face F of this cone 
can be described as 

F = {pvp'^ : y e 5+} , 

where P G IR"^’’ is some fixed matrix with orthonormal columns (w.l.o.g.). The 
dimension of such a face F is For A, B € Sn, APB refers to the Lowner 

partial order induced by the cone 5+ (APB 4=^ A — B G 5+). 
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In the following A : Sn ^ is a linear operator and is the corre- 
sponding adjoint operator satisfying {A{X),y) = (X, for all X G Sn and 

y G K'”. The operators are of the form 









(A^,X) 



m 

and A^{y)=^y^A,. 

i=l 



We will denote a subset J C {1, • ■ • ,m} of the rows of A by Aj. Likewise we 
will speak of Aj and yj for some vector y G 

We consider semidefinite programs of the following form with J C {1 , . . . , m} 
and J = { 1 , . . . , m} \ J, 



max (C, X) 
s.t. Aj{X) = bj 
A.]{X) < bj 
X>0. 



min (6, y) 

(D) s.t. Z = A^{y)-C 
Z hO,y.j > 0 - 



Under a constraint qualification (which we tacitly assume to hold), any 
optimal solution X* of (P) and any optimal solution (y*,Z*) of (D) satisfy 
= 0 . 

We assume that there exists y G K'" with / = Aj(y), which is equivalent 
to the requirement that tr(X) = a = b'jijj for all feasible X. Most semidefinite 
relaxations of combinatorial optimization problems satisfy this requirement or 
can be scaled appropriately. To avoid the trivial case of X = 0, assume a > 0. 

Under these assumptions it is well known that (D) is equivalent to the eigen- 
value optimization problem (see e.g. [8]) 

min aAmax(C' - M^(y)) -h (6, y) . 

yeTR’^,yj>0 

To simplify notation we drop, without loss of generality, the coefficient a and 
assume that | J| = m, i.e., there are inequality constraints only. Thus we concen- 
trate on the following problem 

miny>o f{y) with f{y) = Amax(C' - M^(y)) -k (6, y) , (1) 

or equivalently min/5 := f + is with is being the indicator function of S' = M™ 
(*s(y) = 0 if y G S, 00 otherwise). 



3 The Function Amax(^) 

The maximal eigenvalue Amax(^) of a matrix is a nonsmooth convex function. 
The kinks of Amax(^) appear at points where the maximal eigenvalue has mul- 
tiplicity at least two. If, for some X G Sn, the columns of P G form an 
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orthonormal basis of the eigenspace of the maximal eigenvalue of X, then the 
subdifferential (the set of subgradients) of Aniax(-) at X is 



dXm..{X) = {PVP'^ : tr(l/) = 1,V e S+} 

= COnV : v'^Xv = Amax(-^), ||f II = l| ■ 



( 2 ) 



It is a well known fact that 

An.ax(^) = max {(kb, X) : tr(kb) = 1, kb ^ 0} . 

Any subset VV C (kb ^ 0 : tr(kb) = 1} gives rise to a non-polyhedral cutting 
plane model minorizing Amax(A'), 

Amax(A) > max {(kb, A): kb Gw}. (3) 

4 The Spectral Bundle Method with Inequality 
Constraints 

It follows from the properties of Amax(’) that (1) is a nonsmooth convex mini- 
mization problem. A standard approach to solve such problems is the proximal 
bundle method [7]. We first sketch the method in general and give the corre- 
sponding symbols that we will use for the spectral bundle method. Afterwards 
we specialize the algorithm to the eigenvalue problem ( 1 ). 

At iteration k the algorithm generates a test point = argmins/^ + 

III • — x^lp, where is an accumulated cutting plane model of / and the weight 
M > 0 keeps near the current iterate x^. A serious step occurs 

if /(a;^) — /(y^+^) > TOi[/(a;^) — /^(2/^+^)], where uil G (O, 5 ). Otherwise a null 
step = x^ is made but the new subgradient computed at y^^^ improves 

the next model. 

The spectral bundle method is tailored for the eigenvalue problem (1) in that, 
instead of the usual polyhedral /^, it uses a semidefinite cutting plane model of 
the form (cf. (3)) 



f^{y) = max^^g^*, (kb, C - A^{y)) + {b, y) . (4) 

We will specify the choice of and discuss the computation of in detail in 
Sect. 5. The usual bundle subproblem of finding 

= argminj,>o /'=(y) + |||y - a:'^^ (5) 

turns out to be rather expensive computationally because of the sign constraints 
on y. Instead we compute, by means of Lagrangian relaxation, an approximation 
to To this end we define the Lagrangian of (4)-(5) 

L{y, kb, y) = (C, W) + {b- A(W), y) -{yy) + ^\\y- x^\\^ 
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for W € and rj G M™. It has the form L(-;W,t]) = fw,rj{-) + f|| • — 
where 

f^^^(.) = {C,W) + {b-r,-A{W),-) 

is an affine function minorizing fg = f^ + is and hence fs- The dual function 
(p{W, rj) = min L(-; W, rj), computed by finding 

■ I / N '*^11 fcii2 fc b-rj-A{W) 

Jhy.r, = argmin/vy,^(y) + - y- a; =x , 

V 2 u 

has the form 

77) = (C, IT) + (6 - 77 - A{W),x>^) - 1||6 - 77 - ^(Vh)f . 

Denote by 77^ the multipliers of the previous iteration. We find an approximate 
maximizer T7^“''^) to (p over x IR™ by first computing 

Wk+i (z Arg max^g^*, (p{W, 77 '=) ( 6 ) 

via an interior point algorithm, and then setting 

77'=+! = argmax^>o , 1 )) = — umin |o, x^ — • (7) 

It is not difficult to check that 

:= yy^k+i^^k+i > 0 and = 0 - 

At the test point y^~^^ the function is evaluated and a new subgradient is com- 
puted. 

By the observations above, we have 

f = /u^^+y^^+7 <fs< fs- 

This and the following lemma motivate a stopping criterion of the form 
/(^fc)-/-fc+i(yfc+i)<^opt(|/(:r'=)| + l). 

Lemma 1. f{x^) > and if f{x^) = then x^ is optimal. 

Proof. Since f{x^) > and 

jfc+l( 3 .fe) _ Jk+l^yk+l^ ^ _ yk+l _ A(W'=+ 1 ), - /+!) 

= - 116 - 77 '=+! -A(W'=+i)f, 
u 

the inequality follows. If equality holds then f{x^) = /'=+!(a;'=) and 0 = 6 — 
77'=+! - A(IT'=+!) = V/'=+! G dfsi,x^). □ 
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For practical reasons we state the algorithm with an inner loop allowing for 
several repetitions of the two “coordinate wise” maximization steps ( 6 ) and (7). 



Algorithm 1. 

Input: G 1R+, £opt > 0, £m G (0jOo]> oti improvement parameter tol G 

( 0 , 5 ), a weight u > 0. 



1. (Initialization) Set k = Q, = y^, rf = 0, /(a;°) and W° (cf. Sect. 5). 

2. (Direction finding) Set rj'^ = r]^ . 

(a) Find W+ G Argmax^^yy^ ip{W,if^). 

(b) Compute vy'*' = argmax^>o ry). 

(c) Compute y'^ = yw+,r]+- 

(d) (Termination) If f{x^) — fw+ ^ £opt(|/(a^^)| + 1) then STOP. 



3. 



- fw+,v+{y^)] then go to (a). 



(e) If r{y^) - fw+,q+{y^) > £M{/(a;\ 

Set 77 '"+^ = ? 7 +, = W+, = y+ . 

(Evaluation) Compute f{y^'^^), 
find 1F|+i G Argmax^C- : tr(VF) = 1,VF^ O}. 

4 . (Model updating) Determine D conv (cf. Sect. 5). 

5. (Descent test) If /(a;^) — > WL[/(a;^) — then set = 

(serious step); otherwise set (null step). 

6. Increase k by 1 and go to Step 2. 



For £m = 00 the algorithm performs exactly one inner iteration as described 
before. This single inner iteration suffices to guarantee convergence. 

Theorem 1. Let Sopt = 0 and Sm = 00 . If Argmin/s yf 0 then x^ ^ x G 
argmin/ 5 , otherwise ||a;^|| ^ 00 . In both cases f{x^) i inf /s. 

The proof is rather technical and we refer the reader to the full version of the 
paper for details. To sketch the main idea only, assume that test point y^ has 
caused a null step. Then y^ is the (unconstrained) minimizer of the strongly 
convex function /^(•) + §ll'~a;^|P with modulus u and therefore, since G VV^, 

> L(/; VF^ 77^=) + . 

Likewise we obtain 

Using these relations and the fact that the subgradients generated by the algo- 
rithm remain locally bounded one can arrive at Theorem 1 along a similar line 
of arguments as given in [7]. 



5 VV^ and the Quadratic Semidefinite Subproblem 

Whereas the convergence analysis of the algorithm imposes only very few re- 
quirements on VV^ and its update to its actual realization is of utmost 

importance for the efficiency of the algorithm. The spectral bundle method uses 

Vi;'= = I W = P’^ViP'^f + aW'" : tr(U) -k a = 1, U G 5'+, a > o} , 
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^ 

where is some fixed matrix with orthonormal columns and W € 

satisfies tr{W ) = 1 . should span at least partially the eigenspace belonging 

^ 

to the largest eigenvalues of matrices C — A^{y) for y in the vicinity of a;^. W 
serves to aggregate subgradient information that cannot be represented within 
the set {^P^V {P^y : tr(y) = 1,V € 5'+}. The use of W ensures convergence 
of the spectral bundle method even if the number of columns of P^ is restricted 
to r = 1. The quality of the semidefinite model as well as the computation time 
for solving (6) depend heavily on r. 

The special structure of allows us to evaluate directly: 

f (y) = max{A„,ax((P")^(C - Al^(y))P"), (w\c - Al^(y))} + {b,y) . 

Observe that the eigenvalue computation involved is cheap since the argument 
is an r X r symmetric matrix. 

By the choice of for a fixed rj = 77^, (6) is equivalent to the quadratic 
semidefinite program 

min ^\\b-r]- A{P'^V{P'^Y + aW^)f 

- (p'^ViP'^y + C - - (6 - y, xy 

s.t. tr(y) + a = 1 
y ^ 0, Of > 0. 

Its optimal solution {V*,a*) gives rise to p a*W^ and 

is also used to update P^ and W in Step 4 of the algorithm. Let QAQ'^ be 
an eigenvalue decomposition of V*. Then the ‘important’ part of the spectrum 
of is spanned by the eigenvectors associated with the ‘large’ eigenvalues 

of V*. Thus the eigenvectors of Q are split into two parts Q = [Q1Q2] (with 
corresponding spectra A\ and 212), Qi containing as columns the eigenvectors 
associated to ‘large’ eigenvalues of V* and Q2 containing the remaining columns. 
P^+^ then contains an orthonormal basis of the columns of P^Qi and at least 
one eigenvector to the maximal eigenvalue of C — computed in Step 3 

of the algorithm. The next aggregate matrix is 

= a*w'' + P'^Q2A2Ql{P'^y /{a* + tr(Al2)). (9) 

This ensures G VV^+^. For VV° we choose to contain a set of or- 

thonormalized eigenvectors to large eigenvalues of C — Al(y°). We do not use the 
aggregate in the first iteration. 

(8) has to be solved for each update of ry in the inner loop or, if £m = 00, at 
least once per outer iteration. Thus the efficiency of the whole algorithm hinges 
on the speed of this computation. We briefly explain the most important issues 
that arise in this context. 

Using the svec-operator described in [12] to expand symmetric matrices 
from Sr into column vectors of length and by ignoring all constants, (8) 
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can be brought into the following form (recall that, for A,B G Sr, {A,B) = 
svec(A)^svec(i?) and that tr(y) = (/, V)) 

min isvec(y)^QiiSvec(y) + o(7^svec(T4) + \q 22 C(^ + cJ'svec(V) + 02 a 
s.t. a + sfsvec(V) = 1 (10) 

o: > 0, V y 0, 



where 



Qn = ^ Z™ 1 svec(P’^ AiP)svec(P’^ A^P)’^ 
qi2 = isvec{P^A'^{A{W))P) 
q22 = h{A(W),A(W)} 

Cl = —a svec(P^(i^^(6 — t]) + C — A^{y))P) 

C2 = -a{{l{h-y)-y,A{W)) + {C,W)) 

Si = svec(/). 

This problem has + l variables and can be solved quite efficiently by interior 
point methods if r is not too large, smaller than 25, say. Since convergence is 
guaranteed even for r = 1, it is possible to run this algorithm for problems with 
a huge number of constraints m. Several remarks are in order. 

First, it is not necessary to have W available as a matrix, it suffices to store 
the m- vector A{W) and the scalar (C, VF). These values are easily updated 
whenever W is changed. 

Second, almost all computations involve the projected matrices P"^ AiP. 
These have to be computed only once for each evaluation of (10), they are 
symmetric of size r x r, and only one such projected matrix has to be kept in 
memory if the values of Qn to C2 are accumulated. 

Third, the most expensive operation in computing the cost coefficients is the 
accumulation of Qu, which involves the summation of m dyadic products of 
vectors of size for a total of 0{mr^) operations. Even for rather small r 

but sufficiently large m, this operation takes longer than solving the reduced 
quadratic semidefinite program. 

Finally, changes in y do not affect the quadratic cost matrices Qu to (722, 
but only the linear cost coefficients c\ and C2. Within the inner loop it suffices 
to update the linear cost coefficients. The dominating cost of computing Qu 
can be avoided. If the inner iteration yields only small changes in 77 then the 
optimal solution will also change only slightly. This can be exploited in restarting 
strategies. 

We solve (10) by the primal-dual interior point code used in [6]. It maintains 
primal and dual feasibility throughout {U is the dual variable of V, (3 is the dual 
variable of a, and t is the dual variable to the primal equality constraint), 

Qiisvec(y) -I- aqi 2 + c\ — tsi — svec{U) = 0, (11) 

aq22 + qf2^vec{V) + C 2 - t - (3 = 0, (12) 

1 — a — sfsvec(E) = 0. (13) 
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We employ a restart procedure for reoptimizing the quadratic program in the 
inner loop. Let {U* ,V* , a* , P* ,t*) denote the optimal solution of the latest prob- 
lem, and let Z\ci and Z\c2 denote the update (caused by the change in 77) to be 
added to ci and C2- We determine a starting point (C/°, a°, /3°, by the 

following steps. First, note that (V,a) = (J, l)/(r + 1) is the analytic center 
of the primal feasible set. Therefore any point on the straight line segment 
-I- (1 — ^)(/, l)/(r + 1) with ^ S [0,1] is again primal feasible. Fur- 
thermore it can be expected to be a point close to the central path if the new 
optimal solution is relatively close to the old optimal solution. We determine ^ 

by ^ = min{. 99999, max[.9, ^1- 

In (11) and (12) the change from (V*,a*) to the starting point (F°,a°) 
determines the changes in the dual variables up to a diagonal shift that can be 
applied through t. This diagonal shift is chosen so that the changes in U and 
P are positive definite. Thus C/° (/3°) is strictly positive definite but not too 
far from U* (P*)- Experience shows that this restarting heuristic reduces the 
number of iterations to two thirds down to one half depending on the size of the 
changes in 77. 

6 Implementation and Combinatorial Applications 

The algorithm has been implemented as a direct extension of the algorithm in 
[6] and uses essentially the same parameter settings. In particular, ttil = 0.1, 
r = 25, £opt = 5 • 10“^, and the weight u is updated dynamically by safeguarded 
quadratic interpolation. We present some computational results for £m = -6 and 
£m = 00 . The choice of £m = -6 should keep the number of inner iterations 
small, at the same time avoiding null steps enforced by the bad quality of the 
relaxation. Our computational experiments were carried out on a Sun Sparc 
Ultra 1 with a Model 140 UltraSPARC CPU and 64 MB RAM. Computation 
times are given in hours:minutes:seconds. 

We apply the code to the max-cut instances Gi to G21 on 800 vertices 
from [6]. Graphs Gi to G5 are unweighted random graphs with a density of 
6% (approx. 19000 edges). Ge to Gio are the same graphs with random edge 
weights from { — 1, 1}. Gn to G13 are toroidal grids with random edge weights 
from {—1,1} (approx. 1200 edges). G14 to G17 are unweighted ‘almost’ planar 
graphs having as edge set the union of two (almost maximal) planar graphs (ap- 
prox. 4500 edges). Gig to G21 are the same almost planar graphs with random 
edge weights from (—1, 1}. In all cases the cost matrix G is the Laplace matrix 
of the graph divided by four, i.e., let A denote the (weighted) adjacency matrix 
of G, then G = (Diag(Ae) — A)/4. 

To the basic semidefinite relaxation (diag(A) = e) we add 1600 triangle 
inequalities. These were determined by our code from [5], which is a primal-dual 
interior point code. We applied the separation routine of this code to the optimal 
solution of the semidefinite relaxation without inequality constraints. We also 
used the code to solve the relaxation with the same triangle constraints and will 
compare these results to the spectral bundle method. 
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Table 1. Comparison of the spectral bundle and interior point approaches. 



Problem 


IP-sol 


spectral bundle 
1% 0.1% 


1 interior point 
1% 0.1% 


Gi 


12049.33 


19 


3:51 


2:16:14 


4:05:13 


G2 


12053.88 


20 


5:37 


2:15:57 


4:04:42 


Gs 


12048.56 


17 


4:13 


2:44:06 


4:06:09 


G4 


12071.59 


17 


3:11 


2:42:53 


4:04:20 


Gs 


12059.68 


15 


3:21 


3:39:56 


5:02:24 


Gs 


2615.55 


1:24 


14:47 


3:37:52 


4:59:34 


Gr 


2451.61 


1:28 


11:55 


3:37:48 


4:59:28 


Gs 


2463.68 


1:42 


19:19 


3:37:47 


4:59:27 


Gs 


2493.51 


1:38 


16:44 


3:38:45 


5:00:46 


Gio 


2447.79 


1:22 


12:17 


3:35:53 


4:57:31 


Gii 


623.49 


9:25 


54:55 


4:55:20 


6:10:01 


Gi2 


613.02 


9:32 


47:01 


4:57:09 


6:43:04 


Gi3 


636.45 


8:52 


39:00 


4:31:17 


6:46:56 


Gi4 


3181.35 


1:19 


10:13 


3:36:13 


5:24:18 


Gi5 


3161.89 


1:09 


11:51 


3:35:00 


5:22:31 


Gi6 


3164.95 


1:24 


12:55 


4:07:35 


5:57:37 


Gi7 


3161.78 


1:20 


12:32 


3:35:22 


5:23:04 


Gis 


1147.61 


4:10 


15:45 


4:30:04 


5:51:05 


Gis 


1064.57 


4:23 


17:17 


4:59:07 


6:47:54 


G 20 


1095.46 


4:31 


14:58 


4:30:13 


5:51:18 


G 21 


1087.89 


3:22 


12:40 


4:56:09 


6:16:55 



For the bundle algorithm the diagonal of C is removed. This does not change 
the problem because the diagonal elements of X are fixed to one. The offset 
e"^{Ae — diag(A))/4 is not used internally in the spectral bundle code. However, 
after each serious step we check externally whether the stopping criterion is 
satisfied if the offset is added. If so, we terminate the algorithm. As starting 
vector we choose the zero vector. 

For inequality constrained problems our spectral bundle code exhibits an 
even stronger tailing off effect than for equality constrained problems. To solve 
the relaxation exactly seems to be out of reach. However, within the first few 
iterations the objective value gets already very close to the optimal value. In 
cutting plane approaches it does not make sense to solve relaxations to optimality 
if much faster progress can be achieved by adding new inequalities. Therefore 
the quick initial convergence is a desirable property for these applications. 

In this spirit we give the time needed to get within 1% and 0.1% of the 
optimum in Table 1 for both the spectral bundle code with £m = .6 and the 
interior point code. The first column specifies the problem instance, the sec- 
ond (IP-sol) gives the optimal value of the relaxation with triangle inequalities. 
Columns three and five (four and six) display the times needed by both codes to 
get within 1% (0.1%, resp.) of the optimum. (The entries for the interior point 
code are estimates, since its output only gives the overall computation time.) 
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Table 2. Performance of the spectral bundle code for £opt = 5-10 ^ and £m = -6. 



Problem 


r 


/(*") 




total time eig_time[%] # calls # 


serious 


Gi 


12049.33 


12055.79 


5.4-10-" 


5:10 


62.9 


15 


11 


G2 


12053.88 


12057.64 


3.1-10“^ 


12:06 


53.99 


25 


16 


Gs 


12048.56 


12051.59 


2.5-10“'‘ 


13:15 


53.21 


27 


18 


Gi 


12071.59 


12078.58 


5.8-10“^ 


4:26 


61.28 


15 


11 


Gs 


12059.68 


12065.88 


5.M0-^ 


6:28 


56.96 


18 


13 


Ge 


2615.55 


2617.04 


5.7-10“^ 


26:17 


45.21 


49 


22 


Gt 


2451.61 


2453.27 


6.8-10“^ 


15:38 


45.95 


34 


21 


Gs 


2463.68 


2464.86 


4.8-10“^ 


31:33 


46.01 


59 


22 


Gg 


2493.51 


2494.79 


4.8-10“'‘ 


28:06 


45.37 


54 


22 


Gio 


2447.79 


2449.71 


7.8-10“^ 


15:20 


43.8 


35 


22 


Gii 


623.49 


623.95 


7.3-10-^ 


1:11:04 


68.64 


94 


30 


Gi2 


613.02 


613.61 


9.6-10“^ 


47:42 


78.13 


90 


35 


Gi3 


636.45 


637.03 




39:00 


79.74 


82 


35 


Gi4 


3181.35 


3182.53 


3.7-10-^ 


19:46 


64.67 


43 


29 


Gi5 


3161.89 


3163.24 


4.3-10“'‘ 


19:51 


66.16 


40 


27 


Gi6 


3164.95 


3165.98 


3.3-10“^ 


29:32 


64.73 


50 


28 


Gi7 


3161.78 


3163.52 


5.5-10-^ 


18:48 


67.55 


41 


28 


Gis 


1147.61 


1148.25 


5.6-10“^ 


21:54 


55.48 


48 


31 


Gi9 


1064.57 


1065.06 


4.6-10“'‘ 


22:33 


54.99 


48 


28 


G 20 


1095.46 


1095.94 


4.4-10“^ 


20:05 


56.02 


46 


29 


G 21 


1087.89 


1088.72 


7.6-10“^ 


14:48 


59.57 


40 


29 



The fast progress of the bundle code on examples Gi to G5 can be explained 
by the large offset value (9500) that is added externally. For the other classes 
the offset is around 0 for Gq to G13 and Gig to G21, being about 2300 for G14 
to G17. The eigenvalue structure of the solutions of Gio to G13 suggests that the 
rather poor performance of the algorithm on these problems is caused by a very 
ffat objective. 



Table 2 provides more detailed information on the overall performance of the 
spectral bundle code for £m = -6. Column /* refers to the value of the interior 
point solution, f{x^) gives the objective value at termination, is the 

relative accuracy with respect to /*, total time gives the computation time, 
eig-time is the percentage of time spent in the Lanczos code (eigenvalue and 
eigenvector computation), # calls counts the number of objective evaluations 
and # serious the number of serious steps. 



Somewhat surprisingly the spectral bundle method also yields reasonable 
results for £m = 00. The corresponding values are displayed in Table 3. 
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Table 3. Performance of the spectral bundle code for £opt = 5-10 ^ and £m = oo. 



Problem 


r 


/(*") 


friri-r 


total time eig_time[%] # calls 


serious 


Gi 


12049.33 


12055.79 


5.4-10”'‘ 


5:14 


62.42 


15 


11 


G2 


12053.88 


12057.31 


5.4-10“^ 


13:13 


54.22 


27 


16 


Gs 


12048.56 


12053.27 


3.9-10“^ 


11:54 


54.06 


25 


17 


Gi 


12071.59 


12078.58 


5.8-10“^ 


4:23 


61.98 


15 


11 


Gs 


12059.68 


12065.88 


5.M0-^ 


6:26 


57.51 


18 


13 


Ge 


2615.55 


2616.89 


1 

0 

16 


26:32 


45.79 


52 


24 


Gt 


2451.61 


2453.27 


6.8-10“^ 


15:14 


46.50 


34 


21 


Gs 


2463.68 


2464.86 


4.7-10“^ 


32:02 


44.59 


59 


22 


Gg 


2493.51 


2494.69 


4.7-10“'‘ 


30:14 


46.14 


58 


23 


Gio 


2447.79 


2449.51 


7.0-10“^ 


14:32 


44.84 


34 


22 


Gii 


623.49 


623.89 


6.4-10“^ 


1:30:05 


71.3 


113 


33 


Gi2 


613.02 


613.69 


1.1-10“® 


53:17 


74.8 


95 


33 


Gi3 


636.45 


637.05 


9.4-10“'‘ 


43:31 


79.1 


87 


36 


Gi4 


3181.35 


3182.68 


1 

0 


19:23 


64.49 


43 


28 


Gi5 


3161.89 


3163.18 


1 

0 


19:29 


66.47 


40 


27 


Gi6 


3164.95 


3165.98 


3.3-10“^ 


28:55 


65.42 


50 


28 


Gi7 


3161.78 


3163.46 


5.3-10-^ 


18:35 


67.89 


41 


28 


Gi8 


1147.61 


1148.28 


5.8-10“^ 


20:16 


58.31 


47 


30 


Gi9 


1064.57 


1065.04 


4.4-10“'‘ 


23:24 


59.69 


51 


28 


G20 


1095.46 


1095.94 


4.3-10“^ 


18:39 


59.34 


46 


28 


G2I 


1087.89 


1088.63 


6.8-10“^ 


15:09 


61.17 


42 


29 
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